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BACKGROUND OF THE INV ENTION 



Field of the Invention 

The present invention relates to the screening of drugs, especially random peptides, in 
yeast cells for the ability to interact with proteins involved in the post-trans lational 
modification, transport of and response to yeast pheromones or substitutes therefor. 

Description of the Background Art 

Drug Screening 

The identification of biological activity in new molecules has historically been 
accomplished through the use of in vitro assays or whole animals. Intact biological entities, 
either cells or whole organisms, have been used to screen for anti-bacterial, anti-fungal, anti- 
parasitic and anti-viral agents in vitro. Cultured mammalian cells have also been used in 
screens designed to detect potential therapeutic compounds. A variety of bioassay endpoints 
are exploited in mammalian cell screens including the stimulation of growth or differentiation 
of cells, changes in cell motility, the production of particular metabolites, the expression of 
specific proteins within cells, altered protein function, and altered conductance properties. 
Cytotoxic compounds used in cancer chemotherapy have been identified through their ability 
to inhibit the growth of tumor cells in vitro and in vivo. In addition to cultures of dispersed 
cells, w hole tissues have served in bioassays, as in those based on the contractility of muscle. 
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vitro testing is a preferred methodology in that it permits the design of high- 
throughput screens: small quantities of large numbers of compounds can be tested in a short 
period of time and at low expense. Optimally, animals are reserved for the latter stages of 
compound evaluation and are not used in the discovery phase; the use of whole animals is 
labor-intensive and extremely expensive. 

Microorganisms, to a much greater extent than mammalian cells and tissues, can be 
easily exploited for use in rapid drug screens. Yeast provide a particularly attractive test 
system; extensive analysis of this organism has revealed the conservation of structure and 
function of a variety of proteins active in basic cellular processes in both yeast and higher 
eukaryotes. 

The search for agonists and antagonists of cellular receptors has been an intense area 
of research aimed at drug discovery due to the elegant specificity of these molecular targets. 
Drug screening has been carried out using whole cells expressing functional receptors and, 
recently, binding assays employing membrane fractions or purified receptors have been 
designed to screen compound libraries for competitive ligands. Duke University, 
WO92/05244 (April 2, 1992) describes the expression of mammalian G protein-coupled 
receptors in yeast and a means of identify ing agonists and antagonists of those receptors using 
that organism. 

In addition, yeast are, of course, used in the discovery of antifungal compounds; 
Etienne et al. (1990) describe the use of Saccharomyces cerexisiae mutant strains, made 
highly sensitive to a large range of antibiotics, for the rapid detection of antifungals. 
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)'easi Pheromone System Proteins and Their Metabolic Function 1 — 

Haploid yeast cells are able not only to grow vegetatively, but also to mate to form a 
diploid cell. The two mating types ("sexes") of haploid cells are designated a and a. The a 
cells produce the dodecapeptide a-factor, and the a cells, the tridecapeptide a-factor. 
Because a-factor and a-factor elicit a mating response in the yeast cell of the opposite "sex", 
they are called "pheromones". These pheromones, as well as other proteins specifically 
involved in the production or transport of, or response to, pheromones, are considered 
"pheromone system proteins". 
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The gene encoding a-factor pheromone, like the a-factor receptor gene, is an a cell- 
specific gene; a cell-specific genes are only expressed in a cells. The gene encoding a-factor 
pheromone, like the a-factor receptor gene, is an a cell-specific gene; a cell-specific genes 
are only expressed in a cells. Other yeast genes belong to a haploid-specific gene set and are 
expressed in haploid cells (a cells or a cells) but not in diploid (a a) cells. In addition, there 
exists a diploid cell-specific gene set, including those genes involved in sporulation. 

In eukaryotic cells, RNA polymerase II promoters contain a specific sequence (the 
TATA box) to which the transcription factor TFIID (TATA binding protein or TBP) binds. 
An active transcription initiation complex includes TFIID, accessory initiation proteins, and 
RNA Pol II. As in higher eukaryotic cells, the TATA box is an essential control sequence in 
yeast promoters. Yeast TATA-box-binding protein (TBP) was identified by its ability to 
substitute in function for mammalian TFIID [Buratowski et al.. Nature 334, 37 (1988); 
Cavallini et al., Nature 334, 77 (1988)]. With only a few apparent exceptions [transcription 
of some glycolytic enzyme genes, see Struhl, Mol. Cell. Biol. 6, 3847 (1986) and Ogden et 
al., Mol. Cell Biol. 6, 4335 (1986)] transcription of yeast genes requires the proximal TATA 
box element and TFIID binding for initiation of transcription. Also required for efficient 
transcription are gene-specific activator proteins; the precise mechanism whereby these gene- 
specific regulatory proteins influence transcription has not been completely elucidated. 

MCMlp (encoded in the MCMI gene) is a non-cell-type-speciflc transcription factor 
in yeast. MCMlp acts alone or in concert with other regulatory proteins to control expression 
of a- and a- cell specific genes. Yeast mating type loci encode the regulatory proteins that 
contribute to the control of cell type-specific expression. These proteins are Matalp 
(encoded by the MATa gene) and Matalp and Mata2p (encoded by the MATa locus). 
MCMlp activates transcription of a-specific genes by binding to an upstream activation 
sequence (L'AS) located in the control region of a-specific genes. Matalp and MCMlp 
interact to enhance each other's binding to specific UAS binding sites to activate a-cell- 
specific gene transcription in a-cells. Mata2p associates with MCMlp to repress a-speciflc 
gene transcription in a-cells. In diploid (a/a) cells, Matalp and Mata2p associate to repress 
the transcription of haploid-specific genes. The Matalp'Mata2p regulatory entity is found 
only in diploid cells. 



Yeast contain two genes encoding the a-factor pheromone, MFotl and MFa2. 
Analysis of yeast bearing mutations in these sequences indicates that MFoxl gives rise to the 
majority of a-factor produced by cells. Expression occurs at a higher level from MFal than 
from \1Fci2 (Kurjan, Mol. Cell. Biol. 5, 787 ( 1985). 

The MFal gene of yeast encodes a 165 aa precursor protein containing an 85 aa leader 
sequence at the N-terminus. The leader includes a 19 aa signal sequence and a 66 aa 
sequence which contains sites for the addition of three oligosaccharide side chains (Kurjan 
and Herskowitz, Cell 39. 933 (1982); Singh et al. Nuc. Acids Res. 11, 4049 (1983); Julius et 
al. Cell 36, 309 (1984). Four tandem copies of the 13 aa a-factor are present in the C- 
terminal portion of the precursor; 6-8 aa spacer peptides precede the a-factor sequences (see 
Figure 2). 

After translocation of the nascent a-factor polypeptide to the ER, the signal sequence 
is cleaved from the precursor protein to yield pro-a-factor (Waters et al. J. Biol. Chem. 263, 
6209 ( 1988 ). The core N-Iinked carbohydrate is added to three sites in the N-terminus of pro- 
a-factor (Emter et al. Biochem. Biophys. Res. Commun. 116, 822 (1983); Julius et al. Cell 
36, 309 (1984); Julius et al. Cell 37, 1075 (1984). Additional glycosylation occurs in the 
Golgi prior to cleavage of pro-a-factor by the KEX2 endopeptidase. This enzyme cleaves 
within each of the spacer repeats leaving a Lys-Arg sequence attached to the C-terminus of a- 
factor peptide (Julius et al. Cell 37, 1075 (1984). The Lys-Arg sequence is removed by the 
action of the KEX-1 protease (Dmochowska et al. Cell 50, 573 ( 1987). The additional spacer 
residues present at the N-terminus of a-factor peptide are removed by the dipeptidyl 
aminopeptidase encoded by STE13 (Julius et al. Cell 32, 839 (1983). Four a-factor peptides 
are released from each precursor protein via the proteolytic processing outlined above and the 
mature a-factor is secreted from the cell. 

Precursors of the 12 aa mature a-factor peptide are encoded in the MFal and MFa2 
genes and are 36 aa and 38 aa residues, respectively (for schematic of MFal gene see Figure 
5). The precursors contain one copy of a-factor and the products of the two genes differ in 
sequence at one amino acid. The two forms of a-factor are produced in equal amounts by a 
cells (Manney et al. in Sexual interactions in eukatyotic microbes, p21. Academic Press, New 
York (1981 >. 



Processing of a-factor entails a process that differs in even detail from that of a- 
factor. The processing of a-factor begins in the cytosol and involves the farnesylation of the 
C-terminal cysteine residue near the carboxyl terminus (-CVIA) by a farnesyl transferase 
(Schafer et al. Science 245, 379 (1989); Schafer et al. Science 249, 1 133 (1990). The a and p 
subunits of the farnesyl transferase are encoded by the RAM2 and RAMI genes, respectively 
(He et al. Proc. Natl. Acad. Sci. 88, 11373 (1991). Subsequent to farnesylation is the 
proteolytic removal of the three amino acids that are C-terminal to the modified cysteine by a 
membrane-bound endoprotease. Next, the carboxy-terminal farnesylated cysteine residue is 
modified further: the carboxyl group is methylated by the product of the STE14 gene. 
STE14p is a membrane-bound S-farnesyl-cysteine carboxyl methyl transferase (Hrycyna et al. 
EMBO. J. 10, 1699 (1991). The mechanisms of the N-terminal processing of a-factor have 
not been elucidated. After processing of the precursors is complete, mature a-factor is 
transported to the extracellular space by the product of the STE6 gene (Kuchler et. al. EMBO 
J. 8, 3973 (1989), an ATP-binding cassette (ABC) transporter. 

In normal S. cerevisiae (budding yeast) a cells, the a- factor binds the G protein- 
coupled membrane receptor STE2. The G protein dissociates into the G a and G p , subunits, 
and the G py binds an unidentified effector, which in turn activates a number of genes. STE20, 
a kinase, activates STE5, a protein of unknown function. STE5 activates STE11 kinase, 
which stimulates STE7 kinase, which induces the KSS1 and/or FUS3 kinases. These switch 
on expression of the transcription factor STE12 . STE12 stimulates expression of a wide 
variety of genes involved in mating, including FUS1 (cell fusion), FAR 1 (cell-cycle arrest), 
STE2 (the receptor), MFA 1 (the pheromone), SST2 (recovery), KAR3 (nuclear fusion) and 
STE6 (pheromone secretion). Other genes activated by the pathway are CHS1 , AGal, and 
KAR3. The multiply tandem sequence TGAAACA has been recognized as a "pheromone 
response element" found in the 5'-flanking regions of many of the genes of this pathway. 

One of the responses to mating pheromone is the transient arrest of the yeast cell in 
the Gl phase of the cell cycle. This requires that all three Gl cyclins (CLN1, CLN2, CLN3) 
be inactivated. It is believed that FUS3 inactivates CLN3, and FAR1 inhibits CLN2. (The 
product responsible for inactivating CLN1 is unknown). 
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The growth arrest is terminated by a number of different mechanisms. First, the a- - 
factor receptor is internalized following binding of the pheromone, resulting in a transient 
decrease in the number of pheromone binding sites. Second, the C-terminal tail of the 
receptor is phosphorylated consequent to ligand binding, resulting in uncoupling of the 
receptor from the transducing G proteins. Third, pheromone-induced increases in expression 
of GPAlp (the Ga-subunit of the heterotrimeric G protein) increase the level of the a subunit 
relative to the G p and G, subunits, resulting in reduction in the level of free G p , and 
consequent inactivation of the pheromone response pathway. Additional mechanisms include 
induction of the expression of SST2 and BAR1 and phosphorylation of the a subunit (perhaps 
by SVG 1). 

Signaling is inhibited by expression of a number of genes, including CDC36, CDC39, 
CDC72 , CDC73 , and SRM 1 . Inactivation of these genes leads to activation of the signaling 
pathway. 

A similar pheromone signaling pathway may be discerned in a cells, but the 
nomenclature is different in some cases (e.g., STE3 instead of STE2). 

Other yeast also have G protein-mediated mating factor response pathways. For 
example, in the Fission yeast S. pombe , the M factor binds the MAP3 receptor, or the P-factor 
the MAM2 receptor. The dissociation of the G protein activates a kinase cascade (BYR2, 
BYR1, SPK1), which in turn stimulates a transcrip-tion factor (STE11). However, in S^ 
pombe , the Get subunit transmits the signal, and there are of course other differences in detail. 
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Pheromone Pathway Mutants 

The effects of spontaneous and induced mutations in pheromone pathway genes have 
been studied. These include the a-factor (MFal and MFot2) genes, see Kurjan, Mol. Cell. 
Biol., 5:787 (1985); the a-factor (MFal and MFa2) genes, see Michaelis and Herskowitz, 
Mol. Cell. Biol. 8: 1309 ( 1988); the pheromone receptor (STE2 and STE3 ) genes, see Mackay 
and Manney, Genetics, 76;273 (1974), Hartwell, J. Cell. Biol., 85:81 1 (1980), Hagen, et al., 
P.N.A.S. (USA), 83:1418 (1986): the FAR1 gene, see Chang and Herskowitz, Cell, 63:999 
(1990); and the SST2 gene, see Chan and Otte, Mol. Cell. Biol., 2:1 1 (1982). 

r"-~ 
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A wide variety of foreign proteins have been produced in S. cerevisiae, either solely in 
the yeast c>1oplasm or through exploitation of the yeast secretory pathway (Kingsman et al. 
TIBTECH 5, 53 ( 1987). These proteins include, as examples, insulin-like growth factor 
receptor (Steube et al. Eur. J. Biochem. 198, 65 1 (1991), influenza virus hemagglutinin 
(Jabbar et al. Proc. Natl. Acad. Sci. 82, 20 19 (1985), rat liver cytochrome P-450 (Oeda et al. 
DNA 4, 203 (1985) and functional mammalian antibodies (Wood et al. Nature 314, 446 
(1985). Use of the yeast secretory pathway is preferred since it increases the likelihood of 
achiev ing faithful folding, glycosylation and stability of the foreign protein. Thus, expression 
of heterologous proteins in yeast often involves fusion of the signal sequences encoded in the 
genes of yeast secretory proteins (e.g., a-factor pheromone or the SUC2 [invertase] gene) to 
the coding region of foreign protein genes. 

A number of yeast expression v ectors have been designed to permit the constitutive or 
regulated expression of foreign proteins. Constitutive promoters are derived from highly 
expressed genes such as those encoding metabolic enzymes like phosphoglycerate kinase 
(PGK1) or alcohol dehydrogenase I (ADH1) and regulatable promoters have been derived 
from a number of genes including the galactokinase (GAL I ) gene. In addition, supersecreting 
yeast mutants can be derived; these strains secrete mammalian proteins more efficiently and 
are used as "production" strains to generate large quantities of biologically active mammalian 
proteins in yeast (Moir and Davidow, Meth. in Enzymol. 1 94, 491 ( 1 99 1 ). 

A variety of heterologous proteins have been expressed in yeast cells as a means of 
generating the quantity of protein required for commercial use or for biochemical study 
(Kingsman et al. TIBTECH 5, 53 ( 1987). In addition, a number of mammalian proteins have 
been expressed in yeast in order to determine whether the proteins (1) will functionally 
substitute for cognate proteins normally expressed within that organism or (2) will interact 
with accessory yeast proteins to accomplish a specific function. Thus it has been determined 
that a human TBP with altered binding specificity will function to initiate transcription in 
yeast [Strubin and Struhl, Cell 68, 721 (1992)]. In addition, mammalian steroid hormone 
receptors [Metzger et al. (1988); Schena and Yamamoto (1988)] and human p53 [Scharer and 
Iggo, Nuc. Acids Res. 20, 1539 ( 1992)] were shown to activate transcription in yeast. 

Expression in yeast of the gag-pol gene of HIV-1 results in the processing of the gag 
protein precursor to yield the products which normally arise within the virion; processing in 
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yeast, as in the virus, is due to the action of the protease encoded within the gag-pol gene 
(Kramer et al. Science 231,1580 ( 1986). 

A number of mammalian ABC transporters have been expressed in yeast to determine 
their ability to substitute for yeast Ste6p in the transport of pheromone. The mammalian 
proteins thus far tested include human Mdrl (Kuchler and Thorner, Proc. Natl. Acad. Sci. 89. 
2302 (1992)) and murine Mdr3 (Raymond et al. Science 256, 232 (1992), proteins involved 
in multidrug resistance; in addition, a chimeric protein containing human CFTR (cystic 
fibrosis transmembrane conductance regulator) and yeast STE6 sequence has been shown to 
transport a-factor pheromone in yeast (Teem et al. Cell 73, 335 (1993). 

An a cell may be engineered to produce the a-factor receptor, and an a cell to make DC- 
factor receptor. Nakayama, et al., EMBO J., 6:249-54 (1987); Bender and Sprague, Jr., 
Genetics 121: 463-76 ( 1989). 

Heterologous G protein-coupled receptors have been functionally expressed in Si 
cerevisiae . Marsh and Hershkowitz, Cold Spring Harbor Symp., Quant. Biol., 53: 557-65 
(1988) replaced the S. cerevisiae STE2 with its homologue from S. Kluyven . More 
dramatically, a mammalian beta-adrenergic receptor and Ga subunit have been expressed in 
yeast and found to control the yeast mating signal pathway. King, et al., Science, 250: 121- 
123 (1990). 

Duke University, WO92/05244 (April 2, 1992) describes a transformed yeast cell 
which is incapable of producing a yeast G protein a subunit, but which has been engineered 
to produce both a mammalian G protein a subunit and a mammalian receptor which is 
"coupled to" (i.e., interacts with) the aforementioned mammalian G protein a subunit. 
Specifically, Duke reports expression of the human beta-2 adrenergic receptor (hpAR), a 
seven transmembrane receptor (STR), in yeast, under control of the GAL1 promoter, with the 
hpAR gene modified by replacing the first 63 base pairs of coding sequence with 1 1 base 
pairs of noncoding and 42 base pairs of coding sequence from the STE2 gene. (STE2 
encodes the yeast a-factor receptor). Duke found that the modified hpAR was functionally 
integrated into the membrane, as shown by studies of the ability' of isolated membranes to 
interact properly with various known agonists and antagonists of h(3AR. The ligand binding 
affinity for yeast-expressed hpAR was said to be nearly identical to that observed for 
naturally produced hpAR. 
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Duke co-expressed a rat G protein a subunit in the same cells, yeast strain 8C, which 
lacks the cognate yeast protein. Ligand binding resulted in G protein-mediated signal 
transduction. 

Duke teaches that these cells may be used in screening compounds for the ability to 
affect the rate of dissociation of Get from Gpy in a cell. For this purpose, the cell further 
contains a pheromone-responsive promoter (e.g. BAR1 or FUS1 ), linked to an indicator gene 
(e.g. HIS3 or LacZ ). The cells are placed in multi-titer plates, and different compounds are 
placed in each well. The colonies are then scored for expression of the indicator gene. 

Duke's yeast cells do not, however, actually produce the compounds to be screened. 
As a result, only a relatively small number of compounds can be screened, since the scientist 
must ensure that a given group of cells is contacted w ith only a single, known compound. 

Yeast have been engineered to express foreign polypeptide variants to be tested as 
potential antagonists of mammalian receptors. Libraries encoding mutant glucagon 
molecules were generated through random misincorporation of nucleotides during synthesis 
of oligonucleotides containing the coding sequence of mammalian glucagon. These libraries 
were expressed in yeast and culture broths from transformed cells were used in testing for 
antagonist activity on glucagon receptors present in rat hepatocyte membranes (Smith et al. 
1993). 

Drugs which overcome the multiple drug resistance (MDR) of cancer cells may be 
identified by using transformed yeast cells expressing P-glycoprotein (Suntory Ltd., patent 
application JP 2257873 entitled "Multiple drug resistance-relating gene- comprises P- 
glycoprotein accumulated in cell membrane part of transformed yeast"). The drugs were not 
produced by the yeast cells in question. 

A yeast strain has been derived to allow the identification of inhibitors of protein 
farnesyl transferase which exhibit activity against mammalian Ras and which may therefore 
function as antitumor drugs (Hara et al. 1993 ). 

| Deleted : \ 

Genetic Markers in Yeast Strains 

Yeast strains that are auxotrophic for histidine ( H1S3 ) are known, see Struhl and Hill), 
Mol. Cell. Biol.. 7:104 (1987); Fasullo and Davis, Mol. Cell. Biol., 8:4370 ( 1988). The H1S3 
(imidazoleglycerol phosphate dehydratase) gene has been used as a selective marker in yeast. 
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See Sikorski and Heiter, Genetics. 122:19 (1989); Struhl, et aL P.N.A.S. 76:1035 (1979): 
and, for FUS1-HIS3 fusions, see Stevenson, et aL Genes Dev.. 6:1293 ( 1992). 

peptide Libraries. Peptide libraries are systems which simultaneously display, in a form 
which permits interaction with a target, a highly diverse and numerous collection of peptides. 
These peptides may be presented in solution (Houghten), or on beads (Lam), chips (Fodor), 
bacteria (Ladner USP 5,223,409), spores (Ladner USP 409), plasmids (Cull) or on phage 
(Scott, Devlin, Cwirla, Felici, Ladner '409). Many of these systems are limited in terms of the 
maximum length of the peptide or the composition of the peptide (e.g., Cys excluded). Steric 
factors, such as the proximity of a support, may interfere w ith binding. Usually, the screening 
is for binding in vitro to an artificially presented target, not for activation or inhibition of a 
cellular signal transduction pathway in a living cell. W hile a cell surface receptor may be 
used as a target, the screening will not reveal whether the binding of the peptide caused an 
allosteric change in the conformation of the receptor. 

Ladner, USP 5-,096,815 describes a method of identifying novel proteins or 
polypeptides with a desired DNA binding activity. Semi-random ("variegated") DNA 
encoding a large number of different potential binding proteins is introduced, in expressible 
form, into suitable host cells. The target DNA sequence is incorporated into a genetically 
engineered operon such that the binding of the protein or polypeptide will prevent expression 
of a gene product that is deleterious to the gene under selective conditions. Cells which 
survive the selective conditions are thus cells which express a protein which binds the target 
DNA. While it is taught that yeast cells may be used for testing, bacterial cells are preferred. 
The interactions between the protein and the target DNA occur only in the cell (and then only 
in the nucleus), not in the periplasm or cytoplasm, and the target is a nucleic acid, and not a 
pheromone system protein surrogate. 

Substitution of random peptide sequences for functional domains in cellular proteins 
permits some determination of the specific sequence requirements for the accomplishment of 
function. Though the details of the recognition phenomena which operate in the localization 
of proteins within cells remain largely unknown, the constraints on sequence variation of 
mitochondrial targeting sequences and protein secretion signal sequences have been 
elucidated using random peptides (Lemire et aL, J. Biol. Chem. 264, 20206 ( 1989) and Kaiser 
et al. Science 235. 312 (1987). respectively). 
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All references cited in this specification are hereby incorporated by reference. No | - — — — — 

admission is made that any reference constitutes prior art. 
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SUMMARY OF THE INVENTION 

In the present invention, a yeast cell is engineered to express an exogenous protein 
which is, however, capable of substituting for a yeast protein which is involved in the post- 
translational modification, transport, recognition or signal transduction of a yeast pheromone. 
sufficiently, to be able, at least under some circumstances, to carry out that role of the yeast 
protein. For the sake of convenience, these yeast proteins will be referred to as "pheromone 
system proteins" (PSP), and their cognate non-yeast proteins as PSP surrogates. 

The pheromone system of a yeast cell is thus subverted so that the response of the cell 
to a yeast pheromone (or a ligand of the yeast pheromone receptor) is at least partially 
determined by the activity of the surrogate PSP. In a preferred embodiment, the cognate yeast 
PSP is not produced in functional form, so that the response is essentially entirely dependent 
on the activity of the surrogate PSP. 

Such yeast cells may be used to identify drugs which inhibit or activate, to a 
detectable degree, the ability of the surrogate to substitute for the cognate yeast PSP. To 
screen for an inhibitor, a normally functional surrogate is expressed, and the presence of an 
inhibitor is indicated by a depression of the cellular response. To screen for an activator, a 
surrogate functional in yeast, or one normally not functional in yeast but which is activatable 
(the latter is preferred, to minimize background) is used, and the activator is detected through 
its elevation of the cellular response. 

In a preferred embodiment, the candidate drug is a peptide, and the yeast cell is 
engineered to express the candidate drug as well as the surrogate PSP. 

Another consideration is that with wild-type yeast cells, to achieve pheromone 
secretion and response, both a and a cells must be provided. In some preferred embodiments, 
a cells are engineered to express a-factor receptor or a cells are engineered to express a- 
factor receptor. These modified cells may be considered "autocrine" because they are "self- 
stimulatory". Cells which express both a surrogate for the pheromone receptor, and a 
heterologous peptide agonist for the surrogate receptor, are also considered "autocrine", 
because they will respond to the co-produced agonist. 
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The classes of PSPs and PSP surrogates are numerous. In one embodiment, the PSP 
surrogate is a surrogate for a PSP involved in the upstream processing of the pheromone prior 
to its interaction with the receptor. In sub-embodiments, the PSP may be one involved in the 
post-translational modification of the precursor protein to yield mature pheromone (e.g., 
proteases, carboxymethyltransferases or farnesyltransferases) or one involved in the secretion 
or transport of the pheromone (e.g., an ABC transporter). The pheromone itself, and its 
precursor, may also be considered an upstream PSP.) 

In another embodiment, the PSP surrogate is a surrogate for a G protein-coupled 
receptor. 

In a third embodiment, the PSP surrogate is a surrogate for a G protein or G protein 
subunt, especially the alpha subunit. 

In a fourth embodiment, the PSP surrogate is a surrogate for a PSP involved in the 
downstream transduction of a signal received by the pheromone receptor. Such PSPs include 
kinases and cyclins. Examples of these embodiments are discussed in more detail below. 
Farnesyltransferases and carboxymethyltransferases. After expression, a -factor is 
farnesylated by RAMlp and RAM2p and carboxym ethyl ated by Stel4p. These modifications 
are required for activity. 

RAMlp and RAM2p are homologous to the subunits of the heterodimeric mammalian 
farnesyltransferase, which itself is necessary for membrane association of mammalian Ras 
proteins. If a yeast cell is engineered to express the mammalian farnesyltransferase, it may be 
used to identify drugs which interact with that enzyme by determining whether a functional a- 
factor is produced. Similarly, Stel4p is homologous to mammalian 

carboxymethyltransferases, which play regulatory roles in controlling the function of low 
molecular weight G proteins (Ras, Rho, Rab). 

r ' 
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Proteases. The PSP may be a yeast protease, such as KEX2, STE13 or KEX1. Yeast a- 
factor pheromone is generated through the controlled and limited proteolysis of precursor 
proteins by these proteases. A yeast cell may be engineered to express an inactive precursor 
of yeast a-factor in which a peptide linker, corresponding to the cleavage site of a surrogate 
non-yeast protease, is incorporated so that cleavage will liberate mature a-factor (or its 
functional homologue). For example, the PSP surrogate may be HIV protease, with the 



cleavage site of HIV protease being substituted for the yeast protease cleavage sites in the cx- 
factor precursor. The precursor and the HIV protease are co-expressed in the yeast cell. 
Proteolysis by HIV protease will give rise to production of mature a-factor and initiation of 
signal transduction. This system may be used to identify' inhibitors of HIV protease. 

Preferably, unlike yeast cells occurring in nature, the yeast cell is engineered not only 
to express the a-factor precursor, but also the a-factor receptor, so that a single haploid ty pe 
of yeast is all that is required to conduct the assay. 

ABC Transporters Ste6 is the yeast ABC transporter necessary for the export of a-factor. 
The yeast cell is engineered to express both a-factor and a foreign ABC transporter. This 
transporter may be one which is not, by itself, able to transport a-factor, but which in the 
presence of a drug of interest, is capable of doing so. or it may be one which is already 
functional. 

Preferably, the yeast cell is engineered to express not only a-factor, but also the a- 
factor receptor. 

G Protein-Coupled Receptors. The PSP may be a yeast pheromone receptor. The surrogate 
is a non-yeast. G protein-coupled receptor. In order to achieve coupling to the pheromone 
signal transduction pathway, it may be necessary to provide a foreign or chimeric G a or G pv 
subunit. 

The engineered yeast cell may be used to screen for agonists as well as antagonists. 
When used to screen for agonists, it is preferable that the yeast pheromone not be produced in 
functional form. 

frotein Kinases. The PSP may be a protein kinase, such as the FUS1. KSS1. STE1 1 or STE7 
proteins, which participate in the cellular response to pheromones. The PSP surrogate would 
be, e.g., a mammalian, mitogen-activated protein kinase. Yeast cells engineered to express 
the surrogate protein kinase could be used to screen for activators or inhibitors thereof. 

Cyclins. The PSP may be a cyclin. such as CLN1, CLN2 or CLN3. The cyclins regulate the 
progression of cells through the cell cycle. The human C, Dl and E cyclins are capable of 
substituting functionally for the CLN proteins of yeast. Inhibitors of mammalian cyclins may 
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be useful in cancer chemotherapy. Yeast cells engineered to express a surrogate cyclin may 

be used to identity molecules which inhibit (or enhance) its activity. 
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Peptide Libraries 

While others have engineered yeast cells to facilitate screening of exogenous drugs as 
receptor agonists and antagonists, the cells did not themselves produce both the drugs and the 
receptors. Yeast cells engineered to produce the receptor, but that do not produce the drugs 
themselves, are inefficient. To utilize them one must bring a sufficient concentration of each 
drug into contact with a number of cells in order to detect whether or not the drug has an 
action. Therefore, a microtiter plate well or test tube must be used for each drug. The drug 
must be synthesized in advance and be sufficiently pure to judge its action on the yeast cells. 
When the yeast cell produces the drug, the effective concentration is higher. 

In a preferred embodiment, the yeast cells collectively produce a "peptide library", 
preferably including at least 10 7 different peptides, so that diverse peptides may be 
simultaneously assayed for the ability to interact with the PSP surrogate. 

In an especially preferred embodiment, at least some peptides of the peptide library 
are secreted into the periplasm, where they may interact with the "extracellular" binding 
site(s) of an exogenous receptor. They thus mimic more closely the clinical interaction of 
drugs with cellular receptors. This embodiment optionally may be further improved (in 
assays not requiring pheromone secretion) by preventing pheromone secretion, and thereby 
avoiding competition between the peptide and the pheromone for signal peptidase and other 
components of the secretion sy stem. 



JDetection of Signal Transduction 

Yeast cells may also be engineered so that their pheromone signal transduction 
pathways provide a more readily detectable evidence of the activity of a suspected drug. In 
these embodiments, the drug need not be a peptide produced by the same yeast cell, or even a 
peptide at all. 

As previously mentioned, one of the consequences of activation of the pheromone 
signal pathway in wild-type yeast is growth arrest. If one is testing for an antagonist of a G 
protein-coupled receptor, or of other pheromone system proteins, this normal response of 
growth arrest can be used to select cells in which the pheromone response pathway is 
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inhibited. That is. cells exposed to both a known agonist and a peptide of unknown activity 
will be growth arrested if the peptide is neutral or an agonist, but will grow normal ly if the 
peptide is an antagonist. Thus, the growth arrest response can be used to advantage to 
discover peptides that function as antagonists. 

However, when searching for peptides which can function as agonists of G protein- 
coupled receptors, or other pheromone system proteins, the growth arrest consequent to 
activation of the pheromone response pathway is an undesirable effect for this reason: cells 
that bind peptide agonists stop growing while surrounding cells that fail to bind peptides will 
continue to grow . The cells of interest, then, will be overgrown or their detection obscured by 
the background cells, confounding identification of the cells of interest. To overcome this 
problem the present invention teaches engineering the cell such that: 1 ) growth arrest does 
not occur as a result of pheromone signal pathway activation (e.g., by inactivating the FAR1 
gene); and/or 2) a selective growth advantage is conferred by activating the pathway (e.g., by 
transforming an auxotrophic mutant with a H1S3 gene under the control of a pheromone- 
responsive promoter, and applying selective conditions). 

It is, of course, desirable that the exogenous receptor (or other PSP surrogate) be 
exposed on a continuing basis to the peptides. Unfortunately, this is likely to result in 
desensitization of the pheromone pathway to the stimulus. Desensitization may be avoided 
by mutating (which may include deleting) the SST2 gene so that it no longer produces a 
functional protein, or by mutating other genes which may contribute to desensitization, e.g.. 
BAR 1 in the case of a cells and SVG1 for either a or a cells. 

If the endogenous pheromone receptor (or other cognate PSP) is produced by the yeast 
cell, the assay will not be able to distinguish between peptides which interact with the 
pheromone receptor (or other cognate PSP) and those which interact with the exogenous 
receptor (or other PSP surrogate). It is therefore desirable that the endogenous gene be 
deleted or otherwise rendered nonfunctional. 

The claims are hereby incorporated by reference as a further description of the 
preferred embodiments. 

■ 

BRIEF DESCRIPTION OF THE DRAW INGS 
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Figure 1. Outline of successive stages in the development of yeast autocrine systems. 
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An outline of the normal synthesis and release of mating pheromones is diagrammed in the 
upper left. Two genes, SfFal and SfFal. encode precursor proteins (MFctlp and MFa2p> 
containing four and two repeats, respectively, of the tridecapeptide representing mature a- 
factor. These precursors are processed proteolytically in a series of enzymatic reactions that 
begin with cleavage of the signal sequence in the endoplasmic reticulum and involve both 
glycosylation of the leader peptide and cleavage by the proteases KEX2p, STE13p. and 
KEX1P. The result is the secretion of mature a-factor which, upon binding to STE2p 
normally expressed on the surface of a cells, elicits a number of changes in the a cells, 
including growth arrest. The a cells, in turn, express two genes. SfFal and SfFal. which 
encode precursors (MFalp and MFa2p) for a-factor. These precursors undergo farnesylation 
by RAMI and RAM2, proteolytic trimming of the C-terminal three amino acids (by a protein 
tentatively identified as RAM3p). carboxymethylation of the newly exposed C-terminal 
cysteine by STE14p, and endoproteolytic removal of the N-terminal leader sequence by an 
activity provisionally identified as STE19p. Upon export of the mature a-factor from the cell 
via STE6p, it binds to STE3p expressed on the surface of a cells and stops their growth. 

Delete d: T 

Stage 1 involves the development of yeast strains in which SST2, FAR I, and If IS 3 are 

inactivated and a suitable reporter construct like fusl::HIS3 is integrated into the genomes of 

both a and a cells, a cells are further altered by replacement of the normally expressed 

STE3p with STE2p, while a cells are further modified by replacement of the normally 

expressed STE2p with STE3p. The resulting strains should show growth on histidine- 

deflcient media in the absence of exogenous pheromone. 
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Stage 2 involves, first, inactivation of SfFal and S1Fa2 in cells and inactivation of SI Fa l and 
SfFal in a cells developed in Stage I. These modifications will result in strains which are 
auxotrophic for histidine. Next, the appropriate expression plasmid will be introduced: the 
expression plasmid pADC-MF (see Figure 4) containing an oligonucleotide encoding ct- 
factor should confer upon a cells the ability to grow 7 on histidine-deficient media; the 
expression plasmid pADC-MFa (see Figure 6) containing an oligonucleotide encoding a- 
factor should enable a cells to grow on histidine-deficient media. 



,Stage 3 uses the cells developed in Stage 2 for the insertion of expression plasmids. 
However, instead of using plasmids which contain oligonucleotides that encode genuine 
pheromone, the yeast will be transformed with expression plasmids that contain random or 
semi-random oligonucleotides. Transformants which can grow on histidine-deficient media 
w ill be expanded and their plasmids isolated for sequencing of the inserted oligonucleotide. 

figure 2. Diagram of the plasmid used for mutagenesis of \fFal. A 1.8 kb EcoRI fragment 
containing MFal is cloned into the EcoRI site of pALTER such that single-stranded DNA 
containing the MFal minus strand can be synthesized. The diagram illustrates the different 
regions of MFal, including the promoter, transcription terminator, and different domains of 
the precursor protein: the signal peptide, the pro peptide, the four repeats of mature a-factor, 
and the three spacers which separate these repeats. Above the block diagram of the regions of 
MFal are the amino acid sequences {SEQ ID NO: 1 ) of the signal peptide and the pro peptide; 
below it are those of the pheromone repeats and the spacers (SEQ ID NO:2). The sites of 
proteolytic processing of the precursor protein are indicated by arrows, with each proteolytic 
activity represented by a different arrow, as indicated in the figure. 

Figure 3. Diagram of the plasmids used in the construction of the MFa expression cassette. 
pAAH5 contains the ADC1 promoter which will be used to drive expression of synthetic 
oligonucleotides inserted into the MFa expression cassette. The 1.5 kb BamHI to Hindlll 
fragment containing the ADC1 promoter will be cloned into pRS426. a plasmid which 
functions as a high-copy episome in yeast, to yield pRS-ADC. pRS-ADC will be the 
recipient of MFal sequences which have been mutated as follows: the region of MFal 
which encodes mature a-factor will be replaced with restriction sites that can accept 
oligonucleotides, with Afl II and Bglll ends. Insertion of oligonucleotides with Aflll and Bglll 
ends will yield a plasmid which encodes a protein containing the MFal signal and leader 
sequences upstream of the sequence encoded by the oligonucleotide. The MFal signal and 
leader sequences should direct the processing of this precursor protein through the pathway 
normally used for the secretion of mature a-factor. 
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figure 4. Diagram of constructs used for the expression of random oligonucleotides in the 
context of MFotl. Oligonucleotides containing a region of 39 random base pairs (shown at 
the top of the figure) will be cloned into the Aflll and Bglll sites of the MFal expression 
cassette. These oligonucleotides will encode the six amino acids immediately N-terminal to 
the first repeat of the cx-factor in MFal, followed in succession by a tridecapeptide of 
random sequence and a stop codon. Yeast transformed with these constructs and selected for 
an ability to grow on media deficient in uracil will use the ADC1 promoter to express a 
protein consisting of the MFal leader (both pre and pro peptides) followed by 13 random 
amino acids. Processing of the leader sequences w ill result in secretion of the tridecapeptide. 
A nucleotide sequence (SEQ ID NO:3) is presented upstream of the leader sequence and a 
second nucleotide sequence (SEQ ID NO:4) containing Aflll, Xhol and Bglll sites and coding 
for a peptide with an amino acid sequence (SEQ ID NO:5) is presented downstream from the 
leader sequence. A nucleotide sequence (SEQ ID NO:6) containing an Aflll and a Bell site 
encodes for a peptide with an amino acid sequence (SEQ ID NO:7). 



figure 5. Diagram of the plasmid used for mutagenesis of MFal. A 1 .6 kb BamHI fragment 

containing MFal is cloned into the BamHI site of pALTER such that single-stranded DNA 

containing the MFal minus strand can be synthesized. The diagram illustrates the different 

regions of MFal, including the promoter, transcription terminator, and different domains of 

the precursor protein: the leader peptide; the dodecapeptide that represents the peptide 

component of mature a-factor and whose C-terminal cysteine becomes farnesyiated and 

carboxymethylated during processing; and the C-terminal three amino acids that are removed 

during processing of the precursor. Above the block diagram of the regions of MFal is the 

amino acid sequence (SEQ ID NO:8) of the primary translation product. 
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Figure 6. Diagram of constructs used for the expression of random oligonucleotides in the 
context of MFal. Oligonucleotides containing a region of 33 random base pairs (shown at 
the top of the figure) will be cloned into the Xhol and Aflll sites of the MFal expression 
cassette. These oligonucleotides will encode the seven amino acids immediately N-terminal 
to the first amino acid of mature a-factor, followed in succession by a monodecapeptide of 
random sequence, a cysteine which is farnesyiated and carboxymethylated during processing 
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of the precursor, three amino acids (VIA) which are proteolytically removed during 
processing, and a stop codon. Yeast transformed with these constructs and selected for an 
ability to grow on media deficient in uracil will use the ADC1 promoter to express a 
precursor protein consisting of the MFal leader followed by 1 1 random amino acids and a C- 
terminal tetrapeptide CVIA. Processing of this precursor will result in secretion of a C- 
terminally farnesylated, carboxymethylated dodecapeptide which consists of 1 1 random 
amino acids and a C-terminal cysteine. A nucleotide sequence (SEQ ID NO:9) is presented 
upstream of the MFal leader and a second nucleotide sequence (SEQ ID NO: 10) containing 
an Aflll and a Xhol site and encoding for a peptide with an amino acid sequence (SEQ ID 
NO:ll) is presented downstream from the MFal leader. A nucleotide sequence (SEQ ID 
NO: 12) containing corresponding A fill and Xhol sites for cloning into SEQ ID NO: 10 
encodes for a peptide with an amino acid sequence (SEQ ID NO: 13). 

figure 7 . Autocrine Mata strain secretes and responds to signalling by a-factor. 
A synthetic oligonucleotide encoding the yeast a-factor pheromone was expressed in Mata 
cells. These cells normally express the a-factor pheromone but were prevented from doing so 
by deletion of the endogenous a-factor-encoding genes. Expression and release of a-factor by 
these cells renders them "autocrine" with regard to pheromone signalling. The peptide 
containing mature a-factor was processed within these cells for transport through the yeast 
secretory pathway to the extracellular environment. Pheromone signalling was initiated by 
the binding of a-factor to the Ste2 receptor expressed in Mata cells. Signalling by 
pheromone in the strain background used in these experiments results in growth of responsive 
cells on media that is deficient in histidine. Background growth of control cells that are not 
expressing a-factor is prevented by increasing concentrations of the HIS3 inhibitor, 
aminotriazole. 

Figure 8 . Autocrine MATa strain secretes and responds to signalling by a-factor. 
Yeast a-factor was expressed from a plasmid containing a synthetic pheromone-encoding 
oligonucleotide in Mata cells. The yeast used in these experiments were made "autocrine" by 
replacement of the normally expressed Ste2 protein with the receptor for a-factor, Ste3. The 
a-factor peptide was processed in these cells and transported to the extracellular environment 



by the endogenous Ste6 protein, an ATP-dependent transmembrane transporter. Pheromone 
signalling initiated by the a-factor released by these cells when bound to Ste3 is indicated by 
the growth of the cells on histidine-deficient media. Background growth of control cells, 
which are incapable of expressing a-factor (these a cells lack the plasmid which encodes the 
pheromone) is prevented by increasing concentrations of the HIS3 inhibitor, aminotriazole. 

figure 9 . Plasmid pYMA177 containing mutant human MDR1 (G185V mutation). 
The plasmid pYMA177 was constructed by Karl Kuchler and permits the simultaneous 
overexpression of both a mutant human Mdrl protein and the yeast a-factor pheromone 
precursor (Kuchler & Thorner. Proc. Natl. Acad. Sci. 89, 2302 ( 1992) 

Cadus 1270, containing a galactose inducible, amino-terminal myc-tagged form of the C5a 
receptor, was used to transform a protease deficient strain of yeast. 

.Figure 10 . Activity of a fusl promoter in response to signalling by human C5a expressed in 
autocrine strains of yeast. 

To verify and quantify pheromone pathway activation upon stimulation of the C5a receptor 
by C5a in yeast, the activity of the fusl promoter was determined colorometrically using a 
fusl-lacZ fusion. CY878 (MATct tbtl-1 fusl-HIS3 canl stel4::trpl ::LYS2 ste3*1156 
gpal(41)-Gcci2) was transformed with CADUS 1584 (pRS424-fusl -lacZ) in addition to 
receptor and ligand plasmids listed below. Transformants were grown overnight in synthetic 
medium lacking leucine, uracil, and tryptophan, pH 6.8, 50mM PIPES to an OD 60u of less 
than 0.8 and P-galactosidase activity (Guarente 1983) was assayed. 

jCadus 1289 + Cadus 1215 = Receptor Ligand" = (R-L-) 
Cadus 1303 + Cadus 1215 - Receptor* Ligand = (R+L-) 
Cadus 1289 + Cadus 1297 = Receptor" Ligand* = (R-L+) 
Cadus 1303 + Cadus 1297 = Receptor* Ligand* = (R+L+) 
Receptor refers to the human C5a receptor. 
Ligand refers to human C5a. 

Jigure 11 . This Figure schematically describes three hybrids of GPA1 and GaS. The - 
LLLLGAGES- sequence demarcated in GPA1 directly follows the non-conserved N-terminal 
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domain of the protein. The longer sequence demarcated in GPA1 encodes the "switch 
region" believed to be involved in the conformational change that occurs with nucleotide 
exchange upon receptor activation. 41 -Gas is comprised of the N-terminal 41 amino acids of 
GPA1 linked to Gas sequence from which the native N-terminal sequence has been deleted. 
SGS denotes a molecule comprised of the switch region residues of GPA1 replacing those of 
Gas. GPA 41 -SGS includes both the N-terminal and switch region sequences of GPA1 
inserted into Gas. (See Table 6 for the exact sequence junctions used to construct these 
hybrid proteins). 

RETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention contemplates the assaying of peptides, especially when 
presented in peptide libraries, expressed in genetically engineered yeast cells, for the ability of 
the peptides to interact with pheromone system proteins and PSP surrogates produced by 
those yeast cells. 

For the purpose of the present invention, an "exogenous" protein is one which 
sufficiently differs in amino acid sequence from the proteins naturally produced by the yeast 
cell in question so that its closest cognate is a protein produced by a cell other than a yeast 
cell. The cell producing this cognate protein may be a microbial cell (other than a yeast cell), 
a plant cell, or an animal cell. If an animal cell, it may be of invertebrate (e.g., insect or 
nematode) or of vertebrate (e.g., avian, piscine or mammalian, especially human) origin. A 
protein is considered to be of, e.g., human origin, regardless of whether it is encoded by the 
chromosome of a normal human, or by the genome of a virus which infects and replicates in 
human cells. 

An "activator'' of a pheromone system protein surrogate is a substance which, in a 
suitable yeast cell, causes the pheromone system protein surrogate to become more active, 
and thereby elevates the signal transduced by the native or modified pheromone signal 
pathway of said cell to a detectable degree. The surrogate may be initially nonfunctional, but 
rendered functional as a result of the action of the activator, or it may be functional, and the 
effect of the activator is to heighten the activity of the surrogate. The mode of action of the 
activator may be direct, e.g., through binding the surrogate, or indirect, e.g., through binding 
another molecule which otherwise interacts with the surrogate. W hen the PSP surrogate is a 
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substitute for a pheromone receptor, and the activator takes the place of the pheromone, it is 
customary to refer to the activator as an agonist of the receptor. 

Conversely, an "inhibitor" of a pheromone system protein surrogate is a substance 
which, in a suitable yeast cell, causes the PSP surrogate to become less active, and thereby 
reduces the transduced signal to a detectable degree. The reduction may be complete or 
partial. When the PSP surrogate is a substitute for a pheromone receptor, and the inhibitor 
competes with the pheromone for binding to the receptor, it is customary to refer to the 
inhibitor as an "antagonist". 

The term "modulator" includes both "activators" and "inhibitors". 

A surrogate PSP protein is "functionally homologous" to a yeast protein if. either 
alone or after being modified by a drug, it is able to perform the function of the yeast PSP, or 
an analogous function, within the engineered yeast cell. It is not necessary that it be as 
efficient as the yeast protein, however, it is desirable that it have at least 10% of at least one 
of the pheromone system-related activities of the yeast protein. Nor is it necessary that it 
have the same spectrum of action as the yeast protein, e.g., if it is a receptor, it may respond 
to entirely different ligands than does the endogenous receptor, or to some common ligands 
and some new ones. The receptors of Table 2 are considered functionally homologous with 
the yeast pheromone receptors, even though they do not respond to yeast pheromones, and 
may not couple to the unmodified endogenous G proteins, although they are G protein- 
coupled receptors. This is considered an "analogous function". 

The PSP surrogate may be a protein which must be modified in some way by a drug to 
be functional. For example, the drug could cause an allosteric change in the PSP surrogate's 
conformation, or it could cleave off a portion of the surrogate, the balance of the protein then 
being a functional molecule. 

The PSP surrogate may also be one which is functional only if other modifications are 
made in the yeast cell, e.g., expression of a chimeric G a subunit to interact with an 
exogenous G protein-coupled receptor. 

The term "substantially homologous", when used in connection with amino acid 
sequences, refers to sequences which are substantially identical to or similar in sequence, 
giving rise to a homology in conformation and thus to similar biological activity. The term is 
not intended to imply a common evolution of the sequences. 
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Typically, "substantially homologous" sequences are at least 50%. more preferably at 
least 80°b, identical in sequence, at least over any regions known to be involved in the desired 
activity. Most preferably, no more than five residues, other than at the termini, are different. 
Preferably, the divergence in sequence, at least in the aforementioned regions, is in the form 
of "conservative modifications". 

"Conservative modifications" are defined as 

(a) conservative substitutions of amino acids as hereafter defined; and 

(b) single or multiple insertions or deletions of amino acids at the termini, 
at interdomain boundaries, in loops or in other segments of relatively high 
mobility. 

Preferably, except at the termini, no more than about five amino acids are 
inserted or deleted at a particular locus, and the modifications are outside 
regions known to contain binding sites important to activity . 

Conservative substitutions are herein defined as exchanges within one of the 
following five groups: 

I. Small aliphatic, nonpolar or slightly polar residues: 
Ala, Ser, Thr (Pro, Gly) 

II. Polar, negatively charged residues: and their amides 
Asp, Asn, Glu, Gin 

III. Polar, positively charged residues: 
His, Arg, Lys 

IV. Large, aliphatic, nonpolar residues: 
Met, Leu, He, Val (Cys) 

V. Large, aromatic residues: 
Phe, Tyr, Trp 

Residues Pro, Gly and Cys are parenthesized because they can have special 
conformational roles. Cys participates in formation of disulfide bonds. Gly imparts 
flexibility to the chain. Pro imparts rigidity to the chain and disrupts a helices. These 
residues may be essential in certain regions of the polypeptide, but substitutable elsew here. 
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Two regulator. DNA sequences (e.g., promoters) are "substantially homologous" if 
they have substantially the same regulatory effect as a result of a substantial identity in 
nucleotide sequence. Typically, "substantially homologous" sequences are at least 50%. more 
preferably at least 80%, identical, at least in regions known to be involved in the desired 
regulation. Most preferably, no more than five bases are different. 

For the purposes of the appended claims, the term "chimeric protein" refers to a 
protein which is not identical in sequence to either of two patental proteins A and B, but 
which, when its sequence is aligned with the sequences of A and B, can be seen to borrow 
features (identically or conservatively) from both parental proteins. 

The term "autocrine cell", as used herein, refers to a cell which produces a substance 
which can stimulate the pheromone system pathway of that cell. Wild-ty pe a and a cells are 
not autocrine. However a yeast cell which produces both a-factor and a-factor receptor, or 
both a-factor and a-factor receptor, in functional form, is autocrine. By extension, yeast cells 
which produce a peptide which is being screened for the ability to activate the pheromone 
system pathway (e.g., by activating a G protein-coupled receptor) are called "autocrine cells", 
though it might be more precise to call them "putative autocrine cells". Of course, in a library 
of such cells, in which a multitude of different peptides are produced, it is likely that one or 
more of the cells will be "autocrine" in the stricter sense of the term. 
Farnesyltransferases 

The activity of yeast a-factor requires its farnesylation (mediated by protein 
farnesyltransferase, comprised of Rami p and Ram2p), proteolytic cleavage of the C-terminal 
3 amino acids of the primary translation product (mediated by an as yet unidentified enzyme), 
and carboxymethylation of the C-terminal cysteine (mediated by Stel4p). The yeast and 
mammalian farnesyltransferases are structurally and functionally similar (Gomez R et al., 
Biochem. J. 289:25-31, 1993; Kohl NE et al., J. Biol. Chem. 266:18884-8, 1991). Sequence 
homologies exist between the genes encoding the a and P subunits of the yeast 
farnesyltransferase (H4\f2 and H4\fl, respectively) and the genes encoding the a and p 
subunits of the mammalian farnesytransferase (Kohl NE et al., J. Biol. Chem. 266:18884-8, 
1991; Chen WJ et al.. Cell 66:327-34, 1991). It has been observed that the P subunit of 
mammalian farnesytransferase and Ramlp are 37% identical in amino acid sequence (Chen 
WJ et al., Cell 66:327-34. 1991 ). 
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The importance of a screen for inhibitors of farnes> [-transferase is suggested by the 
facts that mammalian p21ras, a preeminent regulator of the growth and differentiation of 
mammalian cells that is involved in a variety of cancers, is a substrate for the 
farnesyltransferase and that farnesylation of p21ras is required for its activity. In fact, a 
synthetic organic inhibitor of farnesyl protein transferase has been shown to selectively 
inhibit r<xs-dependent cell transformation (Kohl et al.. Science 260, 1934 (1993). Of the two 
subunits of farnesyltransferase, the p subunit is a more attractive target for inhibitors, since it 
is apparently dedicated to farnesyla-tion. The ct subunit, in contrast, is shared by geranyl- 
geranyltransferase I, an enzyme involved in the modification of the Gy subunits of 
heterotrimeric G proteins and small molecular weight G proteins of the Rho/Rac family. 
While the p subunit is dedicated to farnesylation, the mammalian farnesyltransferase has a 
variety of substrates in addition to p21ras. The effect of inhibitors of the p subunit on the 
farnesylation of these other substrates, e.g., lamin proteins, transducin-y and rhodopsin 
kinase, will be considered in the design and use of potential farnesyltransferase inhibitors. 

It has not yet been demonstrated that the homologous mammalian gene will 
functionally substitute for yeast Ramlp, however, this can be formally tested using ram J 
mutants and a vector expressing the mammalian gene encoding the p subunit of the 
farnesyltransferase. If the mammalian p subunit can function in place of Ramlp, test cells 
will be both viable (as a result of farnesylation of Ras) and competent for mating (as a result 
of farnesylation of a-factor). 

If the mammalian gene encoding the p subunit of farnesyl-transferase complements 
rami, yeast would provide a test system for the discovery of potential inhibitors of 
mammalian farnesyl-transferase. Specifically, MATa yeast tester cells could be exploited 
that: 1. carry the gene for the p subunit of mammalian farnesyltransferase in lieu of RAMI; 
2. carry the cam mutation that renders the strains resistant to loss of Ras function in the 
presence of cAMP; 3. respond to a-factor which they export by virtue of heterologous 
expression of Ste3p; 4. respond to autocrine a-factor such that they cannot grow on media 
containing galactose. The latter characteristic will require expression of GAL1 under the 
control of a pheromone-responsive promoter and cells engineered to contain mutated GAL7 
or GAL 10 genes. Expression of GAL1 is toxic in the presence of galactose in strains which 
contain mutations in either the GAL7 or GAL 10 genes. Signaling through the pheromone 
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response pathway would render cells so engineered galactose-sensitive. Exposure of such 
strains to compounds which inhibit the P subunit of farnes> 1-transferase will confer upon 
these cells the ability to grow on media containing galactose and cAMP. 

If the mammalian gene encoding the p subunit of farnesyltransferase (and all modified 
versions of the gene) fails to complement rami, we may use the wild-type Ramlp as a 
surrogate target for potential effectors of mammalian farnesyltransferase. Specifically, we 
will use as tester cells MATa yeast strains that: 1. carry the cam mutation that renders the 
strains resistent to loss of RAS function in the presence of cAMP; 2. respond to a-factor 
which they export by virtue of heterologous expression of Ste3p; 3. respond to autocrine a- 
factor such that they cannot grow on media containing galactose. Exposure of such strains to 
compounds which inhibit the p subunit of farnesy-transferase will confer upon these cells the 
ability to grow on media containing galactose and cAMP. 

In the strategies outlined above, it is desirable to dis-criminate inhibitors of 
farnesytransferase from compounds that either directly block the negative response to a- 
factor, e.g. by interfering with the interaction of the Ste4-Stel8 complex with its effector, or 
by blocking the production of a-factor by a mechanism that does not involve 
farnesyltransferase. Controls would identify such false positives. Candidate agents will be 
tested on a MATa strain that is engineered to secrete a-factor and to respond to the secreted 
a-factor by failing to grow on galactose-containing media, as in the negative selection scheme 
outlined above. The strain will express wild type Ramlp. Any agent that enables these cells 
to grow on media containing galactose and cAMP will not be acting as an inhibitor of 
farnesyltransferase. 

Candidate compounds which pass the foregoing test may act by targeting Stel4p, 
Ste6p, or other proteins involved in the maturation and export of a-factor, rather than 
farnesyl-transferase. (Note, however, that compounds that inhibit processes critical to cell 
survival will not give rise to false positives. For example, since the protease responsible for 
the endoproteolytic removal of the C-terminal tripeptide of the a-factor precursor likely 
participates in the processing of Gg and members of the Rho Rac family of proteins, 
inhibitors of this enzyme may not permit growth of the tester cells). Of the proteins involved 
in the production of a-factor, only the farnesyltransferase is also a major determinant of RAS 
function. Due to this effect, rami mutants are defective for growth at 30°C and completely 
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unable to grow at 37 (He B et al., Proc Natl Acad Sci 88:11373-7, 1991). Tester cells 
(described above) can be grown in the presence of a candidate inhibitor on galactose- 
containing media ± cAMP. If the test compound inhibits farnesyltransferase, cells will be 
capable of growth on galactose + cAMP but not on galactose in the absence of cAMP. This 
difference may be most obvious at 37°. If, on the other hand, the test compound inhibits 
other proteins involved in a-factor production, cells will grow on galactose-containing media 
regardless of the presence or absence of cAMP. 

Compounds which pass the above tests are likely inhibitors of farnesyltransferase. 
This can be confirmed and their potencies determined with direct in vitro enzyme assays. 
Note that the strategies outlined will identify farnesyltransferase inhibitors which affect 
Rami p. Agents which block Ram2p would likely fail to grow under all conditions. Indeed, 
ram2 null mutations are lethal (He B et al., Proc Natl Acad Sci 88:1 1373-7, 1991), perhaps 
due to the fact that Ram2p also functions as a component of geranylgerany ltransferase I. 

Carboxymethyltransferases 

In yeast, methylation of the C-terminal amino acid of a-factor. Ras proteins, and 
presumably Rho/Rac proteins is catalyzed by Stel4p. Although MATa stel4 mutants are 
unable to mate, reflecting the requirement of carboxymethylation for the activity of a-factor, 
steI4 disruptions are not lethal and do not affect the rate of cell proliferation. 
Carboxymethy lation appears to be dispensible for the function of Ras proteins and Stel8p 
(the yeast homologue of the Gy subunit). Although Ras function in yeast can apparently 
tolerate the absence of carboxymethyl modification, it is nonetheless possible that inhibitors 
of mammalian methyltransferases could alter the activity of mammalian p21ras. 

It could be determined if yeast sie!4 mutations can be complemented by the 
homologous mammalian gene, or a modified version of it. One would use an episomal vector 
to express the mammalian gene encoding the methyitransferase in yeast that are genotypically 
stel4. The strain would be a modified MATa strain that expresses the a-factor receptor in 
lieu of the normal a-factor receptor and that contains an integrated fusl-HIS3 construct, so 
that the a-factor secreted by the cell confers autocrine growth on histidine-deficient media. If 
the mammalian methyitransferase can function in place of Stel4p, the tester cells will be 
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capable of mating. That is, the mammalian methyltransferase will permit synthesis of active 
a-factor in ste!4 mutants. 

If the mammalian gene encoding the methvltransferase will complement stel-t, tester 
strains can be constructed to test for potential inhibitors of mammalian methvltransferase. In 
one embodiment, tester MATa yeast strains will: 1. carry a mammalian 
carboxymethyltransferase gene in lieu of STE14; 2. respond to a-factor which they export by 
virtue of heterologous expression of Ste3p; 3. respond to autocrine a-factor such that they 
cannot grow on media containing galactose as in the negative GAL1 selection scheme 
outlined above. Exposure of such strains to compounds which inhibit the methyltransferase 
will confer upon these cells the ability to grow on media containing galactose. 

It is desirable to discriminate inhibitors of carboxy-methyltransferase activity from 
compounds that either directly block the negative response to a-factor, e.g. by interfering with 
the interaction of the Ste4-Stel8 complex with its effector, or block the production of a-factor 
by a mechanism that does not involve methyltransferase. The following control experiments 
will identify such false positives. Candidate inhibitors will be tested on a MATa strain that is 
engineered to secrete a-factor and to respond to the secreted a-factor by failing to grow on 
galactose-containing media. Any agent that enables these cells to grow on media containing 
galactose will be not be acting as an inhibitor of carboxymethyltransferase. Candidate 
compounds which pass the foregoing test may be targetting the carboxy-methyltransferase, 
farnesyltransferase, Ste6p, or other proteins involved in the maturation and export of a-factor. 
In order to discriminate the target of the compounds, a combination of in vitro biochemical 
and in vivo genetic assays can be applied: both the carboxymethyltransferase and the 
farnesyltransferase can be assayed in vitro to test the effect of the candidate agent. 
Furthermore, if the target is Stel4p its overexpression on high-copy plasmids should confer 
resistance to the effect of the compound in vivo. 

Proteases 

Mature yeast a-factor is a thirteen amino acid peptide that is derived from a 
polyprotein precursor in much the same manner as mature mammalian melanocyte- 
stimulating hormone (MSH) or calcitonin are derived from precursor polyproteins. Two 
genes in the yeast genome encode prepro-a-factor, MFal and MFa2. MFal encodes a 



29 



precursor polypeptide containing four copies of mature a-factor embedded in a polypeptide of 
the following structure: hydrophobic pre-sequence hydrophilic pro-sequence a-factor De- 
factor a-factor a-factor. MFa2 encodes a polyprotein precursor of a similar structure 
containing only two copies of mature a-factor. 

Pre-pro-a-factor is synthesized in the cytoplasm and is then transported from the 
c\loplasm to the endoplasmic reticulum and then to the Golgi along the classical secretory 
pathway of 5 cerevisiae. The signal sequence of prepro-a-factor is cleaved during transit 
into the ER by signal peptidase and asparagine-linked oligosaccharides are added (in the ER) 
and modified (in the Golgi) on the pro-segment of the precursor as it transits the secretory 
pathway. Once in the Golgi, three distinct proteolytic processing events occur. First, the 
Kex2 protease cleaves at dibasic residues (-KR-) near the amino terminus of each a-factor 
repeat. Kex2 is homologous to the subtilisin-like endoproteases PC2 and PC1/PC3 involved 
in prohormone processing in mammalian cells (Smeekens and Steiner 1990; Nakayama et al. 
1991). Additional mammalian Kex2-like processing endoproteases include PACE, isolated 
from a human hepatoma, PC4, expressed in testicular germ cells and PC6, a candidate 
protease for the processing of gastrointestinal peptides (Barr et al. 1991; Nakayama et al. 
1992; Nakagawa et al. 1993). It appears that Kex2-like proteins comprise a large family of 
tissue-specific endoproteases in mammalian cells. 

Once Kex2 has released the immature a-factor peptides, two additional proteases act 
to complete processing. Kexl is a specific carboxypeptidase that removes the carboxy- 
terminal-KR remaining after cleavage by Kex2. Like its mammalian counterparts 
carboxypeptidases B and E, Kexl is highly specific for peptide substrates with carboxy- 
terminal basic residues. The final proteolytic processing event that occurs is the removal of 
the spacer dipeptides at the amino terminus of each pro-a-factor peptide. This is 
accomplished by the product of the STE13 gene, dipeptidyl aminopeptidase A. This enzyme 
is a type IV dipeptidyl aminopeptidase: it is capable of cleaving on the carboxyl side of either 
-x-A- or -x-P- sites in vitro. 

Other type IV dipeptidyl aminopeptidases are believed to be active in the processing 
of a variety of pre-peptides in animal cells (Kreil 1 990). In addition, functional similarity has 
been proved between yeast Kexl and Kex2 and their mammalian counter-parts in that both 
yeast enzymes will proteolytically cleave endogenous precursors when expressed in 
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mammalian cells deficient in the native enzyme (Thomas et al. 1988, 1990). It appears likely, 
then, that mammalian homologs of the yeast proteases Kexl, Kex2 and Stel3p, when 
expressed in yeast, will function to process a synthetic a-factor pheromone precursor bearing 
appropriate cleavage sites. Human proteases that may so function in yeast include PC2 and 
PCI PC3 (or other Kex2 homologs), carboxypeptidases B and E (Kexl homologs) and type 
IV dipeptidyl aminopeptidases (Stel3p homologs). 

Yeast would provide a facile assay system for the discovery of inhibitors of proteases 
able to process synthetic a-factor. The yeast could be engineered to express both the 
potential inhibitor and the exogenous protease, and, preferably, not the latter's yeast cognate. 

Furthermore, this means of exploiting yeast pheromone processing to identify protease 
inhibitors can be expanded to encompass any protease that can be expressed to function in 
yeast, provided an appropriate cleavage recognition site is included in a synthetic a-factor 
precursor. In the latter approach, novel proteolytic activities will be added to yeast; these 
enzymes will substitute for proteases in the a-factor maturation pathway but will not be 
"catalytic homologues" of Kexl, Kex2 or Stel3p. Production of mature a-factor will become 
dependent on the activity of the novel protease through removal of the recognition site(s) for 
a selected yeast enzyme from a synthetic MFa gene and insertion of the recognition sequence 
for the novel protease(s). 

Enzymes for which inhibitors could be identified via this strategy include, by way of 
example, HIV-1 protease or other virally encoded proteases involved in the maturation of 
infectious particles; neutrophil elastase believed to be involved in pulmonary disease and 
inflammatory bowel disease; Factor Xa involved in thrombin processing and clotting 
disorders; and CD26, a dipeptidyl peptidase IV and putatively the second receptor for HIV-1 
on CD4* cells. In addition, metalloproteinases (e.g. collagenase) and serine proteases 
involved in tissue invasion by tumor cells and in metastasis would be suitable therapeutic 
targets. In support of this, it has been demonstrated that administration of tissue-derived 
inhibitors of metalloproteinases results in decreased metastasis in animal models (Schultz et 
al. 1988). Collagenases have also been implicated in the destruction of connective tissue 
which accompanies inflammatory processes like rheumatoid arthritis. 

The use of the present invention to screen for modulators of prohormone convertase 
PCI is described in Example 8. 
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.Exogenous ABC Transporters 

The majority of proteins destined for transport to the extracellular environment 
proceed through a secretory pathway that includes translation initiation in the cytoplasm, 
transport to the lumen of the endoplasmic reticulum, passage through the Golgi to secretory 
vesicles and subsequent exit from cells. Other proteins leave the cell by an alternative 
mechanism, which involves mediation by an "ABC transporter". The ABC transporters form 
a family of evolutionary conserved proteins, share a similar overall structure, and function in 
the transport of large and small molecules across cellular membranes (Higgins 1992). The 
characteristic component of members of this protein family is a highly conserved sequence 
that binds ATP (Higgins et al., 1986; Hyde et al. 1990); these intrinsic membrane proteins are 
ATPases, deriving energy from the hydrolysis of that nucleotide to effect the transport of 
molecules. This family includes over 50 prokaryotic and eukaryotic proteins: transporters of 
amino acids, sugars, oligosaccharides, ions, heavy metals, peptides, or other proteins belong 
to this superfamily. Representative transmembrane transporters are included in Table 1. 
Typically, ABC transporters use the energy of ATP hydrolysis to pump substrate across a cell 
membrane against a concentration gradient. Some import substrate, others export it. See 
Higgins, Ann. Rev. Cell, Biol., 8:67-113 (1992). 

The prototypical structure of an ABC transporter includes four membrane-associated 
domains: two hydrophobic, putative membrane-spanning sequences, each predicted to 
traverse the membrane six times, and two nucleotide binding domains that couple ATP 
hydrolysis to transport. In prokaryotes, the domains of an ABC transporter are often present 
on separate polypeptides. Various permutations of domain fusions have been described: the 
E. coli iron hydroxamate transporter contains the two membrane-spanning domains in a 
single polypeptide and the ribose trans-porter of the same organism bears two nucleotide- 
binding domains on one molecule. The major histocompatibility complex (MHC) peptide 
transporter is composed of two polypeptides, Tapl and Tap2. The N-terminus of each protein 
contains a hydrophobic membrane-spanning domain while the C-terminus contains an ATP- 
binding sequence. Together Tapl and Tap2 form a functional complex. The heavy metal 
tolerance protein, HMTI, expressed in the fission yeast Schizosaccharomyces pombt\ consists 
of a polypeptide containing a single hydrophobic domain and a C-terminal ATP-binding 
sequence (Ortiz et al. 1992). It may be that the HMT1 transporter functions as a homodimer. 
The Saccharomyces cere vis iae Ste6 a- factor transporter is expressed as a single polypeptide 



32 



containing two membrane-spanning domains and two nucleotide-binding domains. When 
Ste6 is expressed as two half-molecules, the protein complex which apparently forms retains 
function at a level greater than 50% that of the w ild type, single polypeptide (Berkower and 
Michaels 1991). In other eukaryotic ABC transporters, including Mdrl, CFTR and MRP, the 
four domains are also contained within a single polypeptide. Thus, the ABC transporter may 
be a single multidomain polypeptide, or it may comprise two or more polypeptides, each 
providing one or more domains. 

In general, transporters contain six transmembrane segments per each hydrophobic 
domain, for a total of twelve segments. The minimum number of transmembrane segments 
required for formation of a translocation complex appears to be 10. Thus the histidine 
transporter of .V typhimuhum lacks an N-terminal transmembrane segment from each of its 
hydrophophic domains and therefore contains five transmembrane segments per domain 
(Higgins et al.. Nature 298, 723-727 (1982). The MalF protein of the E. coli maltose 
transporter contains an N-terminal extension of hydro-phobic sequence which bears two 
additional transmembrane segments, bringing the total for this hydrophobic domain to 8 
(Overduin et al. 1988). The N-terminal extension can be deleted, however, without loss of 
function of this transporter (Ehrmann et al. 1990). Although the number of segments required 
for formation of a functional translocator is suggested by these studies, there exists no data on 
the precise structure of the transmembrane segments themselves. These sequences are 
assumed to have an a-helical form, but this has not been proven and the structure of the entire 
translocation complex within the plasma membrane remains to be elucidated. 

In order to span the lipid bilayer, a minimum of 20 amino acids is required and 
sequences believed to form transmembrane segments have been identified using 
hydrophobicity scales. Hydrophobicity scales assign values to individual amino acid residues 
indicating the degree of hydrophobicity of each molecule (Kyte and Doolittle 1982; Engleman 
et al. 1986). These values are based on experimental data (solubility measurements of amino 
acids in various solvents, analysis of side chains within soluble proteins) and theoretical 
considerations, and allow prediction of secondary structure in novel sequence with reasonable 
accuracy. Analysis using hydrophobicity measurements indicates those stretches of a protein 
sequence which have the hydrophobic properties consistent with a transmembrane helix. 

With a few exceptions, there is little or no significant amino acid sequence similarity 
between the transmembrane domains of two different transporters. This lack of sequence 
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similarity is not inconsistent with the apparent function of these hydrophobic domains. While 
these residues must be capable of forming the hydrophobic ot-helical structures believed to 
transverse the plasma membrane, many amino acid residues are hydrophobic and can 
contribute to the formation of an a-helix. Considerable, if as yet inexplicable, sequence 
similarity has been detected in comparisons of the transmembrane domains of the yeast STE6, 
human MDR and E. coli HlyB hemolysin transporters [Gros et al.. Cell 47, 371 (1986); 
McGrath and Varchavsky, Nature 340, 400 (1989); Kuchler et al., EM BO J. 8, 3973 (1989)]. 
Other sequence similarities can be explained by gene duplication, as in the case of the 
transmembrane domains of rodent P-glyco-proteins (Endicott et al. 1991). The 
transmembrane domain of the histidine transporter of S. typhimurium bears homology to that 
of the octopine uptake system of Agrobacterium tumefaciens, the latter two transporters 
translocate chemically similar substrates (Valdiva et al, 1991 ). 

Study of mutant transport proteins has pointed to a role for the transmembrane 
sequences in the recognition of substrate. Thus maltose transporters in E. colt which gain the 
ability to translocate p-nitrophenyl-a-maltoside bear mutations in the transmembrane domain 
(Reyes et al. 1986). A mutation in transmembrane segment 1 1 of MDR has been shown to 
change the substrate specificity of that transporter (Gros et al. 1991 ) and mutation of charged 
residues in the transmembrane domain of CFTR changes its ion selectivity (Anderson et al. 
1991). 

Some aspects of the involvement of extramembrane loop sequences in transport 
function are being elucidated. In a number of bacterial transporters a short conserved motif is 
present on the cytoplasmic loop which connects transmembrane segments 4 and 5 [Dassa and 
Hofnung (1985)]. It has been hypothesized that this sequence interacts with the ATP-binding 
domains of these transport proteins; mutation of this conserved sequence will abolish 
transport function (Dassa 1990). Cytoplasmic loops may also be involved in substrate 
recognition. Thus the sequences follow ing transmembrane segments 7 and 12 of the yeast a- 
factor transporter resemble sequences in the a-factor receptor, Ste3p, and may interact with 
the pheromone substrate (Kuchler et al. 1989). In fact, mutations in the cytoplasmic loops are 
known to alter the substrate specificity of a given transporter. The G185V mutation of human 
MDR, located in the loop between transmembrane segments 2 and 3, alters the interaction of 
that transporter with vinblastine and colchicine (Choi et al. 1988). 
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The ATP-binding domains are about 200 amino acids long, and domains from 
different transporters typically have a sequence identity of 30-50%. The conserved sequences 
include the "Walker motifs" which are associated with many nucleotide binding proteins. 
Walker, et a!., ESfBOJ. 1:945-951 (1982). 

Sequence conserv ation extends over the length of the ATP-binding domain, not being limited 
to the Walker motifs. Furthermore, the ATP-binding domains of a single transporter exhibit 
greater sequence identity to one another than to the domains from two different transporters. 
Not all proteins containing a conserved ATP-binding domain are involved in transport, 
however. The cytoplasmic enzyme UvrA functions in DNA repair and the EF-3 protein of 
yeast is an elongation factor. Yet both proteins contain ATP-binding cassettes identifiable by 
sequence comparison. 

ATP-binding domains are highly hydrophilic and, in the case of transporters, appear to 
reside at the cytoplasmic face of the membrane, anchored there via an association with the 
membrane-spanning domain of these proteins. The points of interaction between the 
transmembrane and ATP-binding domains have not been experimentally determined. Models 
of the structure of the nucleotide binding domain indicate that loop sequences may extend 
from the core of the structure to interface with the hydrophilic sequences which transverse the 
membrane (Hyde et al. 1990; Mimura et al. 1991). The two structural models, one based on 
adenylate cyclase and the other on ras p21 structure, predict a core nucleotide binding fold 
composed of five p-sheets with the Walker A motif (a glycine-rich loop) positioned to 
interact with ATP during hydrolysis. In addition, loop structures (two loops in one model, 
one large loop in the other) are predicted to extend from the core to couple the ATP-binding 
domain to other domains of the transporter. The coupling sequences transmit, most likely 
through conformational change, the energy of ATP hydrolysis to those portions of the 
molecule which are involved in transport. 

t ,Ste6 function is required for mating but the protein is not necessary for yeast survival ^SllSSSf 

Deleted: 

(Wilson and Herskowiz 1984; Kuchler et al. 1989; McGrath and Varshavsky 1989). Ste6 is ~~ 
structurally homologous to the mammalian MDRs. Furthermore, it has been demonstrated 
that two mammalian MDR proteins, murine Mdr3 and human Mdrl, will substitute 
functionally for the yeast transporter in cells deleted for STE6 (Raymond et al. 1992; Kuchler 
and Thorner 1992). Yeast strains deleted for STE6 serve as a starting point for the design of 
screens to discover compounds that modulate the function of exogenous ABC transporters. 
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Two different yeast screens can be used to identify modulators of ABC transporter 
function. In the first instance, a mammalian protein that transports a-factor will serve as a 
target for potential inhibitors of transporter function. Thus, a yeast strain will be engineered 
to express a functional transporter, e.g. mammalian MDR1, which substitutes for the yeast 
Ste6 protein in the transport of a-factor. Furthermore, this strain will be engineered to 
respond in autocrine fashion to a-factor: e.g., so that the cells will be unable to grow on media 
containing galactose. This negative selection will depend on the expression of the GAL1 
gene under the control of a pheromone-responsive promoter in a strain background which 
includes mutated versions of the GAL7 or GAL 10 genes. Expression of GAL1 in the 
presence of galactose in such a strain background is toxic to cells. In the absence of a-factor 
transport, signaling down the pheromone response pathway would cease as would the 
consequent expression of the toxic gene. Cell growth in the presence of a test compound, or 
upon expression of a specific random peptide, would signal inhibition of transport function 
and the identification of a potential therapeutic. 

In addition to inhibitors of MDR, compounds may be identified which interfere with 
the interaction of a-factor with the a-factor receptor. Such compounds can be discriminated 
by their inhibition of a-factor-induced growth arrest in a wild type Mata strain. Compounds 
may also impact at other points along the pheromone response pathway to inhibit signaling 
and these compounds will prevent signal transduction in a wild type Mata strain. 

In a second screen, a mutant heterologous transporter (e.g., mutant CFTR) that is 
initially incapable of transporting a-factor or an a-factor-Iike peptide can be expressed in 
autocrine yeast deleted for endogenous Ste6. The cells will be capable of an autocrine 
response to the a-factor which those cells produce. Thus a pheromone-responsive promoter 
will control expression of a gene that confers an ability to grow in selective media. Such cells 
w ill permit identification of compounds w hich correct defects in the transporter and permit it 
to function in the export of pheromone analogues to the extracellular space. In this way, 
therapeutic peptides or other classes of chemical compounds could be identified which 
stabilize a mutant protein and allow normal processing, transport, localization to the plasma 
membrane and function. This strategy, if successful, may eliminate the need to "replace" 
some mutant genes with normal sequence, as envisioned in gene therapies, by recovering the 
function of mutant proteins through the correction of processing and/or localization defects. 
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In addition to "activators" of the mutant transporter, compounds may also be 
identified which are capable of initiating signalling from the a-factor receptor in the absence 
of transport by the endogenously expressed pheromone. These compounds will be 
distinguished by their ability to cause growth arrest in a wild type Mata strain. Compounds 
may also impact at other points along the pheromone pathway and can be discerned via an 
ability to initiate signalling in a w ild type Mata strain in the absence of a-factor. 

In a preferred embodiment, the exogenous protein produced by the yeast cells is one 
of the exogenous ABC transporters listed in Table 1 . 

Exogenous G protein coupled receptors 

In some instances, for a drug to cure a disease or alleviate its symptoms, the drug must 
be delivered to the appropriate cells, and trigger the proper "switches/ 1 The cellular switches 
are known as "receptors." Hormones, growth factors, neurotransmitters and many other 
biomolecules normally act through interaction with specific cellular receptors. Drugs may 
activate or block particular receptors to achieve a desired pharmaceutical effect. Cell 
surface receptors mediate the transduction of an "external" signal (the binding of a ligand to 
the receptor) into an "internal" signal (the modulation of a cytoplasmic metabolic pathway). 

In many cases, transduction is accomplished by the following signaling cascade: 

• An agonist (the ligand) binds to a specific protein (the receptor) on 
the cell surface. 

• As a result of the ligand binding, the receptor undergoes an 
allosteric change which activates a transducing protein in the cell membrane. 

• The transducing protein activates, within the cell, production of so- 
called "second messenger molecules." 

• The second messenger molecules activate certain regulatory proteins 
within the cell that have the potential to "switch on" or "off specific genes or alter some 
metabolic process. 

This series of events is coupled in a specific fashion for each possible cellular 
response. The response to a specific ligand may depend upon which receptor a cell expresses. 



37 



For instance, the response to adrenalin in cells expressing cx-adrenergic receptors may be the 
opposite of the response in cells expressing P-adrenergic receptors. 

The above "cascade" is idealized, and variations on this theme occur. For example, a 
receptor may act as its own transducing protein, or a transducing protein may act directly on 
an intracellular target without mediation by a "second messenger". 

One family of signal transduction cascades found in eukaryotic cells utilizes 
heterotrimeric "G proteins." Many different G proteins are known to interact with receptors. 
G protein signaling systems include three components: the receptor itself, a GTP-binding 
protein (G protein), and an intracellular target protein. 

The cell membrane acts as a switchboard. Messages arriving through different 
receptors can produce a single effect if the receptors act on the same ty pe of G protein. On 
the other hand, signals activating a single receptor can produce more than one effect if the 
receptor acts on different kinds of G proteins, or if the G proteins can act on different 
effectors. 

In their resting state, the G proteins, which consist of alpha (a), beta (p) and gamma (y 
) subunits, are complexed with the nucleotide guanosine diphosphate (GDP) and are in 
contact with receptors. When a hormone or other first messenger binds to receptor, the 
receptor changes conformation and this alters its interaction with the G protein. This spurs 
the a subunit to release GDP, and the more abundant nucleotide guanosine tri-phosphate 
(GTP), replaces it, activating the G protein. The G protein then dissociates to separate the a 
subunit from the still complexed beta and gamma subunits. Either the Got subunit, or the Gpy 
complex, depending on the pathway, interacts with an effector. The effector (which is often 
an enzyme) in turn converts an inactive precursor molecule into an active "second 
messenger," which may diffuse through the cytoplasm, triggering a metabolic cascade. After 
a few seconds, the Get converts the GTP to GDP, thereby inactivating itself. The inactivated 
Get may then reassociate with the GPy complex. 

Hundreds, if not thousands, of receptors convey messages through heterotrimeric G 
proteins, of which at least 17 distinct forms have been isolated. Although the greatest 
variability has been seen in the a subunit, several different p and y structures have been 
reported. There are, additionally, several different G protein-dependent effectors. 



Most G protein-coupled receptors are comprised of a single protein chain that is 
threaded through the plasma membrane sev en times. Such receptors are often referred to as 
seven-transmembrane receptors (STRs). More than a hundred different STRs have been 
found, including many distinct receptors that bind the same ligand, and there are likely many 
more STRs awaiting discovery . 

In addition, STRs have been identified for which the natural ligands are unknown; 
these receptors are termed "orphan" G protein-coupled receptors. Examples include receptors 
cloned by Neote et al. Cell 72, 415 (1993); Kouba et al. FEBS Lett. 321, 173 (1993); 
Birkenbach et al. J. Virol. 67, 2209 (1993). 

The "exogenous G protein-coupled receptors" of the present invention may be any G 
protein-coupled receptor which is exogenous to the wild-type yeast cell which is to be 
genetically engineered for the purpose of the present inv ention. This receptor may be a plant 
or animal cell receptor. Screening for binding to plant cell receptors may be useful in the 
development of, e.g., herbicides. In the case of an animal receptor, it may be of invertebrate 
or vertebrate origin. If an invertebrate receptor, an insect receptor is preferred, and would 
facilitate development of insecticides. The receptor may also be a vertebrate, more preferably 
a mammalian, still more preferably a human, receptor. The exogenous receptor is also 
preferably a seven transmembrane segment receptor. 

Suitable receptors include, but are not limited to, dopaminergic, muscarinic 
cholinergic, a-adrenergic, (3-adrenergic, opioid (including delta and mu), cannabinoid, 
serotoninergic, and GABAergic receptors. Other suitable receptors are listed in Table 2. The 
term "receptor," as used herein, encompasses both naturally occurring and mutant receptors. 

Many of these G protein-coupled receptors, like the yeast a- and ct-factor receptors, 
contain seven hydrophobic amino acid-rich regions which are assumed to lie within the 
plasma membrane. Specific human G protein-coupled STRs for which genes have been 
isolated and for which expression vectors could be constructed include those listed in Table 2. 
Thus, the gene would be operably linked to a promoter functional in yeast and to a signal 
sequence functional in yeast. Suitable promoters include Ste2, Ste3 and gal 10 . Suitable 
signal sequences include those of Ste2 , Ste3 and of other genes which encode proteins 
secreted by yeast cells. Preferably, the codons of the gene w ould be optimized for expression 



35 

in yeast. See Hoekema et al., Mol. Cell. BioL 7:2914-24 (1987); Sharp, et al., 14:5125-43 
(1986). 

The homology of STRs is discussed in Dohlman et al.. Ann. Rev. Biochem., 60:653- 
88 (1991). When STRs are compared, a distinct spatial pattern of homology is discernable. 
The transmembrane domains are often the most similar, whereas the N- and C-terminal 
regions, and the cytoplasmic loop connecting transmembrane segments V and VI are more 
divergent. 

The functional significance of different STR regions has been studied by introducing 
point mutations (both substitutions and deletions) and by constructing chimeras of different 
but related STRs. Synthetic peptides corresponding to individual segments have also been 
tested for activity. Affinity labeling has been used to identify ligand binding sites. 

It is conceivable that a foreign receptor which is expressed in yeast will functionally 
integrate into the yeast membrane, and there interact with the endogenous yeast G protein. 
More likely, either the receptor will need to be modified (e.g., by replacing its V-VI loop with 
that of the yeast STE2 or STE3 receptor), or a compatible G protein should be provided. 

If the wild-type exogenous G protein-coupled receptor cannot be made functional in 
yeast, it may be mutated for this purpose. A comparison would be made of the amino acid 
sequences of the exogenous receptor and of the yeast receptors, and regions of high and low 
homology identified. Trial mutations would then be made to distinguish regions involved in 
ligand or G protein binding, from those necessary for functional integration in the membrane. 
The exogenous receptor would then be mutated in the latter region to more closely resemble 
the yeast receptor, until functional integration was achieved. If this were insufficient to 
achieve functionality, mutations would next be made in the regions involved in G protein 
binding. Mutations would be made in regions involved in ligand binding only as a last resort, 
and then an effort would be made to preserve ligand binding by making conservative 
substitutions whenever possible. 

Preferably, the yeast genome is modified so that it is unable to produce the 
endogenous a- and a-factor receptors in functional form. Otherwise, a positive assay score 
might reflect the ability of a peptide to activate the endogenous G protein-coupled receptor, 
and not the receptor of interest. 



G protein 



When the PSP surrogate is an exogenous G protein-coupled receptor, the yeast cell 
must be able to produce a G protein which is activated by the exogenous receptor, and which 
can in turn activate the yeast effector(s). It is possible that the endogenous yeast Got subunit 
(e.g., GPA) will be sufficiently homologous to the "cognate" Ga subunit which is natively 
associated with the exogenous receptor for coupling to occur. More likely, it will be 
necessary to genetically engineer the yeast cell to produce a foreign Ga subunit which can 
properly interact with the exogenous receptor. For example, the Ga subunit of the yeast G 
protein may be replaced by the Ga subunit natively associated with the exogenous receptor. 

Dietzel and kurjan. Cell, 50:1001 (1987) demonstrated that rat Gas functionally 
coupled to the yeast Gpy complex. However, rat Gai2 complemented only when 
substantially overexpressed. while GaO did not complement at all. Kang, et al., Mol. Cell. 
Biol., 10:2582 (1990). Consequently, with some foreign Ga subunits, it is not feasible to 
simply replace the yeast Ga. 

If the exogenous G protein coupled receptor is not adequately coupled to yeast Gpy by 
the Ga subunit natively associated with the receptor, the Ga subunit may be modified to 
improve coupling. These modifications often will take the form of mutations which increase 
the resemblance of the Ga subunit to the yeast Ga while decreasing its resemblance to the 
receptor-associated Ga. For example, a residue may be changed so as to become identical to 
the corresponding yeast Ga residue, or to at least belong to the same exchange group of that 
residue. After modification, the modified Ga subunit might or might not be "substantially 
homologous" to the foreign and or the yeast Ga subunit. 

The modifications are preferably concentrated in regions of the Ga which are likely 
to be involved in GPy binding. In some embodiments, the modifications will take the form of 
replacing one or more segments of the receptor-associated Ga with the corresponding yeast G 
a segment(s), thereby forming a chimeric Ga subunit. (For the purpose of the appended 
claims, the term "segment" refers to three or more consecutive amino acids.) In other 
embodiments, point mutations may be sufficient. 

This chimeric Ga subunit will interact with the exogenous receptor and the yeast Gpy 
complex, thereby permitting signal transduction. While use of the endogenous yeast Gpy is 
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preferred, if a foreign or chimeric G(3y is capable of transducing the signal to the yeast 
effector, it may be used instead. 

Ga Structure 

We will now review information regarding Ga structure which is relevant to the 
design of modified Ga subunits. 

In Table 5. part A. the amino terminal 66 residues of GPA1 are aligned with the 
cognate domains of human Gas, Gai2. Gai3 and Gal 6. In part B. we present alignment of 
the amino-terminal 66 residues of GPA1 41 . Ga chimeras. In the GPA 4 , Ga hybrids, the amino 
terminal 41 residues (derived from GPA 1 ) are identical, end with the sequence- 
LEK.QRDKNE- and are underlined for emphasis. All residues following the glutamate (E) 
residue at position 41 are contributed by the human Ga subunits, including the consensus 
nucleotide binding motif -GxGxxG-. Periods in the sequences indicate gaps that have been 
introduced to maximize alignments in this region. Codon bias is mammalian. For alignments 
of the entire coding regions of GPA1 with Gas, Gai, and GaO. Gaq and Gaz, see Dietzel 
and Kurjan ( 1987) and Lambright, et al. ( 1994). Additional sequence information is provided 
by Mattera, et al. (1986). Bray, et al. (1986) and Bray, et al. 0987). 

The sequences are identified as follows: GPA1 (SEQ ID NO: 82); Gas (SEQ ID 
NO:83); Gai2 (SEQ ID NO:84); Gai3 (SEQ ID NO:85): Gal6 (SEQ ID NO:86): GPA41-Ga 
s (SEQ ID NO:87): GPA41-Gai2 (SEQ ID NO:88); GPA41-Gai3 (SEQ ID NO:89); and 
GPA41-Gal6 {SEQ ID NO:90). 

The gene encoding a G protein homolog of S. cerevisiae was cloned independently by 
Dietzel and Kurjan (1987) (SCG1) and by Nakafuku, et al. (1987) (GPA1). Sequence 
analysis revealed a high degree of homology between the protein encoded by this gene and 
mammalian Ga. GPA I encodes a protein of 472 amino acids, as compared with 
approximately 340-350 a. a. for most mammalian Ga subunits in four described families, Ga 
s, Gai, Gaq and Gal 2/1 3. Nevertheless, GPA1 shares overall sequence and structural 
homology with all Ga proteins identified to date. The highest overall homology in GPA1 is 
to the Gai family (48% identity, or 65°o with conservative substitutions) and the lowest is to 
Gas (33% identity, or 51% with conservative substitutions) (Nakafuku. et al. 1987). 
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The regions of high sequence homology among Got subunits are dispersed throughout 
their primary sequences, with the regions sharing the highest degree of homology mapping to 
sequence that comprises the guanine nucleotide binding GTPase domain. This domain is 
structurally similar to the ap fold of ras proteins and the protein _ synthesis elongation factor 
EF-Tu. This highly conserved guanine nucleotide-binding domain consists of a six-stranded 
(3 sheet surrounded by a set of five ct-helices. It is within these p sheets and a helices that the 
highest degree of conservation is observed among all Got proteins, including GPA1. The 
least sequence and structural homology is found in the intervening loops between the p sheets 
and a helices that define the core GTPase domain. There are a total of four "intervening 
loops" or "inserts" present in all Get subunits. In the cry stal structures reported to date for the 
GDP- and GTPyS-liganded forms of bovine rod transducin (Noel, et al. 1993: Lambright, et 
al. 1994), the loop residues are found to be outside the core GTPase structure. Functional 
roles for these loop structures have been established in only a few instances. A direct role in 
coupling to phosphodiesterase-y has been demonstrated for residues within inserts 3 and 4 of 
Gat (Rarick, et al. 1992; Artemyev, et al. 1992), while a "GAP-like" activity has been 
ascribed to the largely a-helical insert 1 domain of GaS (Markby, et al. 1993). 

While the amino- and carboxy-termini of Got subunits do not share striking homology 
either at the primary, secondary, or tertiary levels, there are several generalizations that can be 
made about them. First, the amino termini of Got subunits have been implicated in the 
association of Ga with Gpy complexes and in membrane association via N-terminal 
myristoylation. In addition, the carboxy-termini have been implicated in the association of G 
ctpy heterotrimeric complexes with G protein-coupled receptors (Sullivan, et al. 1987; West, 
et al. 1985; Conklin, et al. 1993). Data in support of these generalizations about the function 
of the N-terminus derive from several sources, including both biochemical and genetic 
studies. 

As indicated above, there is little if any sequence homology shared among the amino 
termini of Ga subunits. The amino terminal domains of Ga subunits that precede the first p- 
sheet (containing the sequence motif -LLLLGAGESG-; see Noel, et al. (1993) for the 
numbering of the structural elements of Ga subunits) vary in length from 41 amino acids 
(GPA1) to 31 amino acids (Gat). Most Ga subunits share the consensus sequence for the 
addition of myristic acid at their amino termini (MGxxxS-), although not all Ga subunits that 



43 



contain this motif have myristic acid covalently associated with the glycine at position 2 
(speigel, et al. 1991 ). The role of this post-translational modification has been inferred from 
studies in which the activity of mutant Got subunits from which the consensus sequence for 
myristoylation has been added or deleted has been assayed (Mumby, et al. 1990; Linder, et al. 
1991; Gallego. et al. 1992). These studies suggest two roles for N-terminal myristoylation. 
First, the presence of amino-terminal myristic acid has in some cases been shown to be 
required for association of Get subunits with the membrane, and second, this modification has 
been demonstrated to play a role in modulating the association of Got subunits with Gpy 
complexes. The role of myristoylation of the GPA1 gene product is. at present, unknown. 

In other biochemical studies aimed at examining the role of the amino-terminus of Get 
in driving the association between Got and Gpy subunits, proteolytically or genetically 
truncated versions of Ga subunits were assayed for their ability to associate with GPy 
complexes, bind guanine nucleotides and/or to activate effector molecules. In all cases, Ga 
subunits with truncated amino termini were deficient in all three functions (Graf, et al. 1992; 
Journot, et al. 1990; and Neer, et al. 1988). Slepak, et al. (1993) reported a mutational 
analysis of the N-terminal 56 a. a. of mammalian Goto expressed in Escherichia coli . 
Molecules with an apparent reduced ability to interact with exogenously added mammalian G 
py were identified in the mutant library. As the authors pointed out, howeve, the assay used 
to screen the mutants—the extent of ADP-ribosylation of the mutant Ga by pertussis toxin- 
was not a completely satisfactory probe of interactions between Ga and GPy. Mutations 
identified as inhibiting the interaction of the subunits, using this assay, may still permit the 
complexing of Ga and Gpy while sterically hindering the ribosylation of Ga by toxin. 

Genetic studies examined the role of amino-terminal determinants of Ga in 
heterotrimer subunit association have been carried out in both yeast systems using GPA1- 
mammalian Ga hybrids (Kang, et al. 1990) and in mammalian systems using Gai/Gas 
hybrids (Russell and Johnson 1993). In the former studies, gene fusions, composed of yeast 
GPA1 and mammalian Ga sequences were constructed by Kang, et al. ( 1990) and assayed for 
their ability to complement a gpal null phenotype (i.e., constitutive activation of the 
pheromone response pathway) in cerevisiae . Kang, et al. demonstrated that wild type 
mammalian Gas. Gai but not Gao proteins are competent to associate with yeast GPy and 
suppress the gpal null phenotype, but only when overexpressed. Fusion proteins containing 
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the ami no- terminal 330 residues of GPA1 sequence linked to 160. 143. or 142 residues of the 
mammalian Gas, Gai and Goto carbox> 1-terminal regions, respectively, also coupled to the 
yeast mating response pathway when overexpressed on high copy plasmids with strong 
inducible (CUP) or constitutive (PGK) promoters. All three of these hybrid molecules were 
able to complement the gpal null mutation in a growth arrest assay, and were T additionally 
able to inhibit a-factor responsiveness and mating in tester strains. These last two 
observations argue that hybrid yeast-mammalian Got subunits are capable of interacting 
directly with yeast G(3y, thereby disrupting the normal function of the yeast heterotrimer. 
Fusions containing the amino terminal domain of Gas, Gai or Gao, however, did not 
complement the gpal null phenotype, indicating a requirement for determinants in the amino 
terminal 330 amino acid residues of GPA1 for association and sequestration of yeast G(3y 
complexes. Taken together, these data suggest that determinants in the amino terminal region 
of Ga subunits determine not only the ability to associate with Gpy subunits in general, but 
also with specific G(3y subunits in a species-restricted manner. 

Hybrid Gai/Gas subunits have been assayed in mammalian expression systems 
(Russell and Johnson 1993). In these studies, a large number of chimeric Ga subunits were 
assayed for an ability to activate adenylyl cyclase, and therefore, indirectly, for an ability to 
interact with Gpy (i.e., coupling of Ga to Gpy = inactive cyclase; uncoupling of Ga from Gpy 
= active cyclase). From these studies a complex picture emerged in which determinants in 
the region between residues 25 and 96 of the hybrids were found to determine the state of 
activation of these alleles as reflected in their rates of guanine nucleotide exchange and GTP 
hydrolysis and the extent to which they activated adenylyl cyclase in vivo . These data could 
be interpreted to support the hypothesis that structural elements in the region between the 
amino terminal methionine and the pi sheet identified in the crystal structure of Gat (see 
Noel, et al. 1993 and Lambright, et al. 1994) are involved in determining the state of activity 
of the heterotrimer by (1) driving association/dissociation between Ga and Gpy subunits; (2) 
driving GDP/GTP exchange. While there is no direct evidence provided by these studies to 
support the idea that residues in this region of Ga and residues in Gpy subunits contact one 
another, the data nonetheless provide a positive indication for the construction of hybrid Ga 
subunits that retain function. There is, however, a negative indicator that derives from this 
work in that some hybrid constructs resulted in constitutive activation of the chimeric 
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proteins (i.e., a loss of receptor-dependent stimulation of Gapv dissociation and effector 
activation). 

Construction of chimeric Got subunits. 

In designing Get subunits capable of transmitting, in yeast, signals originating at 
mammalian G protein-coupled receptors, two general desiderata were recognized. First, the 
subunits should retain as much of the sequence of the native mammalian proteins as possible. 
Second, the level of expression for the heterologous components should approach, as closely 
as possible, the level of their endogenous counterparts. The results described by King, et al. 
(1990) for expression of the human p2-adrenergic receptor and Gas in yeast, taken together 
with negative results obtained by Rang, et al. ( 1990) with full-length mammalian Got subunits 
other than Gas, led us to the following preferences for the development of yeast strains in 
which mammalian G protein-coupled receptors could be linked to the pheromone response 
pathway. 

1. Mammalian Ga subunits will be expressed using the native sequence of each 
subunit or, alternatively, as minimal gene fusions with sequences from the 
amino terminus of GPA1 replacing the homologous residues from the 
mammalian Ga subunits. 

2. Mammalian Ga subunits will be expressed from the GPA 1 promotor either on 
low copy plasmids or after integration into the yeast genome as a single copy 
gene. 

3. Endogenous Gpy subunits will be provided by the yeast STE4 and STE18 loci. 

Site-Directed Mutagenesis versus Random Mutagenesis 

There are two general approaches to solving structure-function problems of the sort 
presented by attempts to define the determinants involved in mediating the association of the 
subunits that comprise the G protein heterotrimer. The first approach, discussed above with 
respect to hybrid constructs, is a rational one in which specific mutations or alterations are 
introduced into a molecule based upon the available experimental evidence . In a second 
approach, random mutagenesis techniques, coupled with selection or screening systems, are 
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used to introduce large numbers of mutations into a molecule, and that collection of randomly 
mutated molecules is then subjected to a selection for the desired phenotype or a screen in 
which the desired phenotype can be observed against a background of undesirable 
phenotypes. With random mutagenesis one can mutagenize an entire molecule or one can 
proceed by cassette mutagenesis. In the former instance, the entire coding region of a 
molecule is mutagenized by one of several methods (chemical, PCR, doped oligonucleotide 
synthesis) and that collection of randomly mutated molecules is subjected to selection or 
screening procedures. Random mutagenesis can be applied in this way in cases where the 
molecule being studied is relatively small and there are powerful and stringent selections or 
screens available to discriminate between the different classes of mutant phenotypes that will 
inevitably arise. In the second approach, discrete regions of a protein, corresponding either to 
defined structural (i.e. a-helices, b -sheets, turns, surface loops) or functional determinants 
(e.g., catalytic clefts, binding determinants, transmembrane segments) are subjected to 
saturating or semi-random mutagenesis and these mutagenized cassettes are re-introduced 
into the context of the otherwise wild type allele. Cassette mutagenesis is most useful when 
there is experimental evidence available to suggest a particular function for a region of a 
molecule and there is a powerful selection and/or screening approach available to 
discriminate between interesting and uninteresting mutants. Cassette mutagenesis is also 
useful when the parent molecule is comparatively large and the desire is to map the functional 
domains of a molecule by mutagenizing the molecule in a step-wise fashion, i.e. mutating one 
linear cassette of residues at a time and then assaying for function. 

We are applying random mutagenesis in order to further delineate the determinants 
involved in Gct-Gpy association. Random mutagenesis may be accomplished by many 
means, including: 

1. PCR mutagenesis, in which the error prone Taq polymerase is exploited to 
generate mutant alleles of Got subunits, which are assayed directly in yeast for 
an ability to couple to yeast Gpy. 

2. Chemical mutagenesis, in which expression cassettes encoding Ga subunits 
are exposed to mutagens and the protein products of the mutant sequences are 
assayed directly in yeast for an ability to couple to yeast Gpy. 

3. Doped synthesis of oligonucleotides encoding portions of the Ga gene. 
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4. In vivo mutagenesis, in which random mutations are introduced into the 
coding region of Got subunits by passage through a mutator strain of coli . 
XLl-Red (mutD5 mutS mutT) (Stratagene, Menasa, \VI). 



The random mutagenesis may be focused on regions suspected to be involved in Goc- 
Gpy association as discussed in the next section. Random mutagenesis approaches are 
feasible for two reasons. First, in yeast one has the ability to construct stringent screens and 
facile selections (growth vs. death, transcription vs. lack of transcription) that are not readily 
available in mammalian systems. Second, when using yeastit is possible to screen effienctly 
through thousands of transformants rapidly. 

Cassette mutagenesis is immediately suggested by the observation (see infra ) that the 
GPA 41 hybrids couple to the pheromone response pathway. This relatively small region of G 
a subunits represents a reasonable target for this type of mutagenesis. Another region that 
may be amenable to cassette mutagenesis is that defining the surface of the switch region of G 
a subunits that is solvent-exposed in the crystal structures of Gail and transducin. From the 
data described below, this surface may contain residues that are in direct contact with yeast G 
py subunits, and may therefore be a reasonable target for mutagenesis. 

Rational Design of Chimeric Ga Subunits 

Several classes of rationally designed GPA 1 -mammalian Ga hybrid subunits have 
been tested for the ability to couple to yeast Py. The first, and largest, class of hybrids are 
those that encode different lengths of the GPA1 amino terminal domain in place of the 
homologous regions of the mammalian Ga subunits. This class of hybrid molecules includes 
GPA BamH1 , GPA 4l , GPA 1D , and GPA LW hybrids, described below. The rationale for 
constructing these hybrid Ga proteins is based on results, described above, that bear on the 
importance of the amino terminal residues of Ga in mediating interaction with Gpy. 

Preferably, the yeast Ga subunit is replaced by a chimeric Ga subunit in which a 
portion, e.g., at least about 20, more preferably at least about 40, amino acids, which is 
substantially homologous with the corresponding residues of the amino terminus of the yeast 
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Ga, is fused to a sequence substantially homologous with the main body of a mammalian (or 
other exogenous) Ga. W hile 40 amino acids is the suggested starting point, shorter or longer 
portions may be tested to determine the minimum length required for coupling to yeast Gpy 
and the maximum length compatible with retention of coupling to the exogenous receptor. It 
is presently believed that only the final 10 or 20 amino acids at the carboxy terminus of the G 
a subunit are required for interaction with the receptor. 

GPABamHi hybrids . Rang et al. (1990) described hybrid Ga subunits encoding the amino 
terminal 310 residues of GPA1 fused to the carboxyl terminal 160, 143 and 142 residues, 
respectively, of GaS, Gai2. and Goto. In all cases examined by Rang et al., the hybrid 
proteins were able to complement the growth arrest phenotype of gpal strains. We have 
confirmed these findings and, in addition, have constructed and tested hybrids between GPA1 
and Gai3, Gaq and Gal 6 (see Example 1 1). All hybrids of this type that have been tested 
functionally complement the growth arrest phenotype of gpal strains. 

GPA_m hybrids . The rationale for constructing a minimal hybrid encoding only 41 

amino acids of GPA1 relies upon the biochemical evidence for the role of the amino-terminus 
of Ga subunits discussed above, together with the following observation. Gp and Gy 
subunits are known to interact via a- helical domains at their respective amino-termini 
(Pronin, et al. 1992; Garritsen, et al. 1993). The suggestion that the amino termini of Ga 
subunits may form an helical coil and that this helical coil may be involved in association of 
Ga with Gpy (Masters, Stroud, and Bourne 1986; Lupas, Lupas and Stock 1992) leads to the 
hypothesis that the three subunits of the G-protein heterotrimer interact with one another 
reversibly through the winding and unwinding of their amino-terrninal helical regions. A 
mechanism of this type has been suggested, as well, from an analysis of leucine zipper 
mutants of the GCN4 transcription factor (Harbury, et al. 1993). The rationale for 
constructing hybrids like those described by Kang, et al. (1990), that contain a majority of 
yeast sequence and only minimal mammalian sequence, derives from their ability to function 
in assays of coupling between Ga and Gpy subunits. However, these chimeras had never 
been assayed for an ability to couple to both mammalian G protein-coupled receptors and 
yeast Gpy subunits, and hence to reconstitute a hybrid signalling pathway in yeast. 



GPA 4I hybrids that have been constructed and tested include Gas, Gai2, Gai3, Gaq. Gao 3 , G 
ao b and Gal 6 (see Example 11). Hybrids of Gas, Gai2, Gai3, and Gal 6 functionally 
complement the growth arrest phenotype of gpal strains, while GPA 4] hybrids of Gao a and G 
ao b do not. In addition to being tested in a growth arrest assay, these constructs have been 
assayed in the more sensitive transcriptional assay for activation of a fuslp-HIS3 gene. In 
both of these assays, the GPA 41 -Gas hybrid couples less well than the GPA 4! -i2, -i3, and -16 
hybrids, while the GPA 41 -o a and -o b hyrids do not function in either assay. 

Several predictive algorithms indicate that the amino terminal domain up to the highly 
conserved sequence motif-LLLLGAGESG- (SEQ ID NO: 120) (the first L in this motif is 
residue 43 in GPA1) forms a helical structure with amphipathic character. Assuming that a 
heptahelical repeat unit, the following hybrids between GPA 1 and GaS can be used to define 
the number of helical repeats in this motif necessary for hybrid function: 

GPAl-7/Gas8-394 
GPA 1-14/Gas 15-394 
GPAl-21/Gas22-394 
GPA 1-28 Gas29-394 
GPAl-35'Gas36-394 
GPAl-42Gas43-394 

In this hybrids, the prediction is that the structural repeat unit in the amino terminal 
domain up to the tetra-Ieucine motif is 7, and that swapping sequences in units of 7 will in 
effect amount to a swap of unit turns of turns of the T helical structure that comprises this 
domain. 
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A second group of "double crossover" hybrids of this class are those that are aligned 
on the first putative heptad repeat beginning with residue Gil in GPA1. In these hybrids, 
helical repeats are swapped from GPA1 into a GaS backbone one heptad repeat unit at a time. 

GaSl-lOGPAl 1-17 Gas! 8-394 
GaSl-17 GPA 1 8-24 GaS25-394 
GaSl-17GPA25-3rGaS32-394 
GaS 1 - 1 7 GP A3 2-3 8 GaS39-394 

The gap that is introduced between residues 9 and 10 in the GaS sequence is to 
preserve the alignment of the -LLLLGAGE-sequence moti f (SEQ ID NO: 121) . 

This class of hybrids can be complemented by cassette mutagenesis of each heptad 
repeat followed by screening of these collections of "heptad" libraries in standard coupling 
assays. 

A third class of hybrids based on the prediction that the amino terminus forms a 
helical domain with a heptahelical repeat unit are those that effect the overall hydrophobic or 
hydrophilic character of the opposing sides of the predicted helical structure (See Lupas. 
Stock and Stock). In this model, the a and d positions of the heptad repeat abcdefg are found 
to be conserved hydrophobic residues that define one face of the helix, while the e and g 
positions define the charged face of the helix. 

In this class of hybrids, the sequence of the GaS parent is maintained except for 
specific substitutions at one or more of the follow ing critical residues to render the different 
helical faces of GaS more "GPA 1 -like" 
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K8Q 

+ 1-10 

E10G 

Q12E 

R13S 

N14D 

E15P 

E15F 

K17L 

E21R 

K28Q 

K32L 

V36R 

This collection of single mutations could be screened for coupling .efficiency to yeast 
GPy and then constructed in combinations (double and greater if necessary). 

A fourth class of hybrid molecules that span this region of GPAl-Ga hybrids are 
those that haye junctions between GPA1 and Ga subunits introduced by three primer PGR. 
In this approach, the two outside primers are encoded by sequences at the initiator methionine 
of GPA1 on the 5' side and at the tetraleucine motif of GaS (for example) on the 3' side. A 
series of junctional primers spanning different junctional points can be mixed with the outside 
primers to make a series of molecules each with different amounts of GPA1 and GaS 
sequences, respectively. 



GPA |n and GPA, w hybrids . The regions of high homology among Ga subunits that have 
been identified by sequence alignment are interspersed throughout the molecule. The Gl 
region containing the highly conserved -GSGESGDST- motif is followed immediately by a 
region of very low sequence consevation, the "iF or insert 1 region. Both sequence and 
length vary considerably among the il regions of the Ga subunits. By aligning the sequences 
of Ga subunits, the conserved regions bounding the il region were identified and two 
additional classes of GPAl-Ga hybrids were constructed. The GPA in hybrids encode the 
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amino terminal 102 residues of GPA1 (up to the sequence -GARKLGIP- (SEP ID NO:122) ) 
fused in frame to mammalian Get subunits, while the GPA LU hybrids encode the amino 
terminal 244 residues of GPA1 (up to the sequence -LIHEDIAK.A- (SEP ID NO:123) in 
GPA 1 ). The reason for constructing the GPA, D and GPA LW hybrids was to test the hypothesis 
that the il region of GPA lis required for mediating the interaction of GPA1 with yeast GfSy 
subunits, for the stable expression of the hybrid molecules, or for function of the hybrid 
molecules. The GPA lD hybrids contain the amino terminal domain of GPA1 fused to the il 
domain of mammalian subunits, and therefore do not contain the GPA1 il region, while the 
GPA LW hybrids contain the amino terminal 244 residues of GPA1 including the entire il 
region (as defined by sequence alignments). Hybrids of both GPA 1D and GPA LAV classes were 
constructed for GaS, Gai2, Gai3, Gao a , and Gal 6; none of these hybrids complemented the 
gpaJ growth arrest phenotype. 

Subsequent to the construction and testing of the GPA 1D and GPA 1W classes of 
hybrids, the crystal structures of Gct^^,,, in both the GDP and GTPyS-liganded form, and 
the crystal structure of several Gail variants in the GTPyS-liganded and GDP-AIF 4 forms 
were reported (Noel et al. 1993; Lambright et al. 1994 and Coleman et al. 1994). The crystal 
structures reveal that the il region defined by sequence alignment has a conserved structure 
that is comprised of six alpha helices in a rigid array, and that the junctions chosen for the 
construction of the GPA 1D and GPA LW hybrids were not compatible with conservation of the 
structural features of the il region observed in the crystals. The junction chosen for the 
GPAj D hybrids falls in the center of the long aA helix; chimerization of this helix in all 
likelihood destabilizes it and the protein structure in general. The same is true of the junction 
chosen for the GPA lw hybrids in which the crossover point between GPA1 and the 
mammalian Ga subunit falls at the end of the short aC helix and therefore may distort it and 
destabilize the protein. 

The failure of the GPA !D and GPA LW hybrids is predicted to be due to disruption of 
critical structural elements in the il region as discussed above. Based upon new alignments 
and the data presented in Noel et al ( 1993 ), Lambright et al (1994), and Coleman et al (1994). 
this problem can be averted with the following hybrids. In these hybrids, the junctions 
between the ras-Iike core domain and the il helical domain are introduced outside of known 
structural elements like alpha-helices. 
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Hybrid A GaS 1 -67 GPA66-299 GaS203-394 

This hybrid contains the entire il insert of GPA 1 interposed into the GaS sequence. 
Hybrid B GPA 1-4 1 / GaS4443-67 GPA66-299/GaS203-394 

This hybrid contains the amino terminal 41 residues of GPA1 in place of the 42 amino 
terminal residues of GaS found in Hybrid A. 

Gas Hybrids . There is evidence that the "switch region" encoded by residues 171-237 of Ga 
transducin (using the numbering of Noel et al (1993)) also plays a role in Gpy coupling. 
First, the G226A mutation in GaS (Miller et al. 1988) prevents the GTP-induced 
conformational change that occurs with exchange of GDP for GTP upon receptor activation 
by ligand. This residue maps to the highly conserved sequence -DVGGQ-, present in all Ga 
subunits and is involved in GTP hydrolysis. In both the Gat and Gail crystal structures, this 
sequence motif resides in the loop that connects the p3 sheet and the a2 helix in the guanine 
nucleotide binding core. In addition to blocking the conformational change that occurs upon 
GTP binding, this mutation also prevents dissociation of GTP-liganded Gas from Gpy. 
Second, crosslinking data reveals that a highly conserved cysteine residue in the a2 helix 
(C215 in Gao, C210 in Gat) can be crosslinked to the carboxy terminal region of Gp 
subunits. Finally, genetic evidence (Whiteway et al. 1993) identifies an important single 
residue in GPA1 (E307) in the p2 sheet of the core structure that may be in direct contact 
with Py. A mutation in the GPA1 protein at this position suppresses the constitutive 
signalling phenotype of a variety of STE4 (GP) dominant negative mutations that are also 
known to be defective in Ga-Gpy association (as assessed in two-hybrid assay in yeast as 
well as by more conventional genetic tests). 

We have tested the hypothesis that there are switch region determinants involved in 
the association of Ga with Gpy by constructing a series of hybrid Ga proteins encoding 
portions of GPA 1 and GaS in different combinations (Figure 1 1). The hybrid proteins were 
tested in both biological assay described above and the results are summarized in Table 6. 

Two conclusions may be drawn. First, in the context of the amino terminus of GaS, 
the GPA1 switch region suppresses coupling to yeast Gpy (SGS), while in the context of the 
GPA1 amino terminus the GPAl switch region stabilizes coupling with Gpy (GPA 4! -SGS). 
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This suggests that these two regions of GPA1 collaborate to allow interactions between Get 
subunits and GPy subunits. This conclusion is somewhat mitigated by the observation that 
the GPA 4] -Gas hybrid that does not contain the GPA1 switch region is able to complement 
the growlh arrest phenotype of gpal strains. We have not to date noted a quantitative 
difference between the behaviour of the GPA 41 -Gas allele and the GPA 41 -SGS allele, but if 
this interaction is somewhat degenerate, then it may be difficult to quantitate this accurately. 
The second conclusion that can be drawn from these results is that there are other 
determinants involved in stabilizing the interaction of Got with G(3y beyond these two regions 
as none of the GPAl/Gas hybrid proteins couple as efficiently to yeast Gpy as does native 
GPAl. 

The role of the surface-exposed residues of this region may be crucial for effective 
coupling to yeast Gpy, and can be incorporated into hybrid molecules as follows below. 
GoS-GPA-Switch GaS l-202/GPA298-350/GaS 253-394 

This hybrid encodes the entire switch region of GPA 1 in the context of GaS. 
GaS-GPA-a2 GaS l-226/GPA322-332/GaS 238-394 

This hybrid encodes the a 2 belix of GPAl in the context of GaS. 
GPA41-GoS-GPA-o2 GPA 1 -4 l/GaS43-226/GPA322-332/GaS238-394 

This hybrid encodes the 41 residue amino terminal domain of GPAl and the a2 helix 
of GPA 1 in the context of GaS. 

Finally, the last class of hybrids that will be discussed here are those that alter the 
surface exposed residues of the p2 and p3 sheets of aS so that they resemble those of the 
GPAl as helix. These altered a2 helical domains have the following structure. (The 
positions of the altered residues correspond to GaS.) 

L203K 
K211E 
D215G 
K216S 
D229S 

These single mutations can be engineered into a GaS backbone singly and in pairwise 

combinations. In addition, they can be introduced in the context of both the full length GaS 
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and the GPA 41 -GaS hybrid described previously. All are predicted to improve the coupling 
of Ga subunits to yeast Gpy subunits by virtue of improved electrostatic and hydrophobic 
contacts between this region and the regions of Gp defined by Whiteway and co-workers 
(Whiteway et al ( 1994) that define site(s) that interact with GPA11. 

Summary . Identification of hybrid Ga subunits that couple to the yeast pheromone 

pathway has led to the following general observations. First, all GPA B;imH] hybrids associate 
with yeast GPy. therefore at a minimum these hybrids contain the determinants in GPA1 
necessary for coupling to the pheromone response pathway. Second, the amino terminal 41 
residues of GPA1 contain sufficient determinants to facilitate coupling of Ga hybrids to yeast 
GPy in some, but not all, instances, and that some Ga subunits contain regions outside of the 
first 41 residues that are sufficiently similar to those in GPA1 to facilitate interaction with 
GPA1 even in the absence of the amino terminal 41 residues of GPA1. Third, there are other 
determinants in the first 310 residues of GPA1 that are involved in coupling Ga subunits to 
yeast Gpy subunits. 

The vavious classes of hybrids noted above are not mutually exclusive. For example, 
a Ga containing GPA1-41 could also feature the L203K mutation. 

While, for the sake of simplicity, we have described hybrids of yeast GPA1 and a 
mammalian Gas, it will be appreciated that hybrids may be made of other yeast Ga subunits 
and/or other mammalian Ga subunits, notably mammalian Gai subunits. Moreover, while 
the described hybrids are constructed from two parental proteins, hybrids of three or more 
parental proteins are also possible. 

As shown in the Examples, chimeric Ga subunits have been especially useful in 
coupling receptors to Gai species. 

Expression of Ga 

Kang et al. (1990) reported that several classes of native mammalian Ga subunits 

were able to interact functionally with yeast py subunits when expression of Ga was driven 
from a constitutive!}' active, strong promotor (PGK) or from a strong inducible promotor 
(CUP). These authors reported that rat GaS, Gai2 or Gao expressed at high level coupled to 
yeast py. High level expression of mammalian Gaji.e. non-stoichiometric with respect to 
yeast py) is not desirable for uses like those described in this application. Reconstruction of 
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G protein-coupled receptor signal transduction in yeast requires the signalling component of 
the heterotrimeric complex (Gpy) to be present stoichiometrically with Ga subunits. An 
excess of Ga subunits (as was required for coupling of mammalian Gai2 and Goto to yeast G 
py_in Kang et al.) would dampen the signal in systems where Gpy subunits transduce the 
signal. An excess of G(3y subunits raises the background level of signalling in the system to 
unacceptably high levels. 

Preferably, levels of Ga and Gpy subunits are balanced. For example, heterologous G 
a subunits may be expressed from a low copy (CEN ARS) vector containing the endogenous 
yeast GPA1 promotor and the GPA1 3' untranslated region. The minimum criterion, applied 
to a heterologous Ga subunit with respect to its ability to couple functionally to the yeast 
pheromone pathway, is that it complement a gpal genotype when expressed from the GPA1 
promoter on low copy plasmids or from an integrated, single copy gene. In the work 
described in this application, all heterologous Ga subunits have been assayed in two 
biological systems. In the first assay heterologous Ga subunits are tested for an ability to 
functionally complement the growth arrest phenotype of gpal strains. In the second assay the 
transcription of a fusl-HIS3 reporter gene is used to measure the extent to which the 
pheromone response pathway is activated, and hence the extent to which the heterologous Ga 
subunit sequesters the endogenous yeast GPy complex. 

Mammalian Gas, Gai2, Gai3, Gaq, Gall, Gal 6, Gao a , Gao b , and Gaz from rat, 
murine or human origins were expressed from a low copy, CEN ARS vector containing the 
GPA1 promoter. Functional complementation of gpal strains was not observed in either 
assay system with any of these full-length Ga constructs w ith the exception of rat and human 
GaS. 

Chimeric Yeast By subunits 

An alternative to the modification of a mammalian Ga subunit for improved signal 
transduction is the modification of the pertinent sites in the yeast Gp or Gy subunits. The 
principles discussed already with respect to Ga subunits apply, mutatis mutandis , to yeast Gp 
or Gy. 

For example, it would not be unreasonable to target the yeast Ste4p Gp subunit with 
cassette mutagenesis. Specifically, the region of Ste4p that encodes several of the dominant 
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negative, signalling-defective mutations would be an excellent target for cassette mutagenesis 
when looking for coupling of yeast Gpy to specific mammalian Get subunits. 

Protein kinases 

Mitogen-activated protein kinase (MAP kinase) and its activator, MAP kinase kinase 
or MEK, are believed to be key molecules in the transduction of intracellular signals in 
mammalian cells. The activity of MAPK, a serine/threonine protein kinase, has been shown 
to depend on its phosphorylation by the dually specific MEK at tyrosine and threonine 
residues. MEK activity, in turn, depends on its phosphorylation on serine and threonine by a 
third kinase, MAP kinase kinase kinase, or MEKK, whose function in some systems is 
fulfilled by the protooncogene product Raflp. 

An essential part of the S. cerexisiae pheromone signalling pathway is comprised of a 
protein kinase cascade composed of the products of the ST EI L STE~, and FLS3/KSSJ genes 
(the latter pair are distinct and functionally redundant). Functional studies have established 
the dependence of FUS3p activity on tyrosine and threonine phosphorylation by STE7p, 
whose activity is regulated by its phosphorylation by STEllp. A second protein kinase 
cascade, responsive to protein kinase C, has been identified in S. cerexisiae. When this 
pathway is disrupted, yeast cells lose their ability to grow in media of low osmotic strength. 
Although its components have not been characterized to the same extent as that of the mating 
pathway cascade, sequence analysis identifies BCKlp as a MEKK, MKKlp/MKK2p as 
MEKs, and MPKlp as a MAPK. 

Kinase signalling pathways appear to be conserved among eukaryotes. Thus 
significant sequence homology is found between Xenopus MAP kinase and the products of 
the following yeast kinase genes: FUS3, KSS1, MPK1 {S. cerexisiae) and Spkl {Schizo- 
saccharomyces pontbe). In addition, mammalian MEK has been found to be homologous to 
the products of STE7 {S. cerexisiae) and Byrl (S. pom be) [Crews et al. Science 258, 478 
(1992)]. Functional homologies between some kinases has been demonstrated through 
substitution of heterologous kinase genes in yeast kinase deletion mutants. Thus Xenopus 
MAP kinase will complement an mpklA mutant in S cerexisiae (however, this kinase will 
not substitute for Fus3p or Ksslp function in the same organism) [Lee et al. Mo!. Cell. Biol. 
13, 3067 (1993)]. Both mammalian and Xenopus MAP kinase will substitute for Spkl 
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function in S pombe [Neiman et al. Molec. Biol. Cell. 4, 107 (1993); Gotoh et al. Molec. 
Cell. Biol. 13, 6427 (1993)]. Rabbit MAP kinase kinase will complement a byrl defect in S 
pombe but only when co-expressed with Rafl kinase; the latter thus appears to be a direct 
activator of MEK (Hughes et al. Nature 364, 349 (1993). 

The use of the instant invention to screen for modulators of human MEK is described 
in Example 9. 

Cyclins 

Members of another mammalian protein kinase family, one active in progression 
through the cell cycle, have been identified by complementation of cell cycle kinase mutants 
in yeast. The human homologue of p34cdc2 {S. pombe) and p34cdc28 (5. cerexisiae), 
proteins which control the progression to DNA synthesis and mitosis in yeast, was identified 
by complementation of a cdc2 mutation in S. pombe [Lee and Nurse, Nature 327, 31-35 
(1987)]. CDK2, a second human p34 homologue, was identified by functional 
complementation of p34cdc28 mutations in S. cerexisiae [El ledge and Spottswood, EMBO J. 
10, 2653 (1991)]. Activation of p34 depends on its association with regulatory subunits, 
termed cyclins. Tight control of cyclin expression as well as the inherent instability of these 
proteins once expressed contribute to a regulated activation of p34 kinase and progression 
through the cell cycle. 

A number of putative Gl human cyclins have been identified through their ability to 
substitute for the yeast Gl cyclins, CLN1, CLN2 and CLN3, in the pheromone signalling 
pathway. Thus human cyclins C, D and E were identified by their ability to rescue cln yeast 
from growth arrest [Lew et al., Cell 66, 1197 (1991)]. It has also been demonstrated that 
other classes of human cyclins can substitute functionally for the CLN proteins in yeast. The 
human cyclins A, Bl and B2 (cyclins normally associated with governance of mitosis) will 
also function as Gl cyclins in 5 cerexisiae [Lew et al., Cell 66, 1 197 (1991)]. 

Certain cyclins are periodically accumulating regulators of the p34 kinase (p34cdc2 in 
humans and p34cdc28 in S. cerexisiae). The p34 kinase is functionally conserved from yeast 
to humans and the activity of this threonine/serine-specific protein kinase is required if cells 
are to progress through several checkpoints in the cell cycle, including one that is termed 
START. START occurs late in the Gl phase and is the switch point for cells between the 
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quiescent state and proliferation. The kinase is activated at discrete points in the cell cycle 
through its association with specific cyclins, regulatory subunits which are also functionally 
conserved among eukaryotes. Three cyclins appear to operate in progression through START 
in S cerevistae, CLN I , CLN2 and CLN3 (Hadwiger et al. 1989; Cross 1988; Nash 1988). 
The sequences of the CLN proteins bear some homology to those of mammalian A- and B- 
type cyclins; these proteins are believed to regulate S (DNA synthesis) and M (mitotic) 
phases of the mammalian cell cycle. 

Sequence comparisons among the cyclin proteins identified in different species 
indicates that a region of high sequence conservation is contained within an approximately 87 
residue domain that generally comprises central cyclin sequence but which is located near the 
amino terminus of the yeast Gl cy clins. This conserved domain is termed the "cyclin box". 
A second region of homology shared by most of the cyclins is a C-terminal sequence rich in 
proline, glutamate, serine, threonine and aspartate residues flanked by basic amino acids that 
is termed a PEST motif (Rogers et al. 1986). PEST motifs are found in unstable proteins and 
are believed to signal for constitutive ubiquitin-mediated degradation. The degradation of 
cyclins A and B is signalled via a different sequence, a "mitotic destruction motif (Glotzer et 
al. 1991 ), that is not shared by other mammalian cyclins. 

Sequence comparisons made between the yeast CLN proteins and the human A, B, C, 
D and E cyclins indicate the existence of appreciable homologies (Lew et al. 1991). Across 
the most conserved regions, including the cyclin box, human cyclin C bears 18% sequence 
identity both to human cyclins D and E and to the yeast CLN proteins. Human cyclins D and 
E appear to be more related to human A- and B-type cyclins (33% identical) than to the yeast 
CLNs (24% identical). 

All human cyclins identified to date will substitute functionally for yeast cyclins. In 
fact, the mammalian cyclins C, Dl and E were identified through their ability to complement 
defective CLN function in yeast (Lew et al. 1991). Mammalian A- and B-type cyclins also 
substitute functionally for the CLN proteins in yeast, therefore this ability cannot definitively 
mark a mammalian cyclin as one that would operate in Gl. However, the cyclins C, Dl and 
E have been shown to be expressed in Gl in mammalian cells and the expression pattern of 
cyclin E during the cell cycle most closely parallels the expression patterns observed for the 
yeast Gl cyclins (Lew et al. 1 991 ; Koff et al. 1991). 
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In mammalian cells, cyclin C mRNA accumulates early in Gl while cyclin E 
accumulates late in that phase. D cyclin mRNA levels are insufficient to allow tracking of 
expression patterns in human cells and the role of this cyclin is therefore not clear (Lew et al. 
1991). In mouse cells, the Dl gene, CYL1, is expressed in the Gl phase and the Dl gene 
appears to be regulated by colony-stimulating factor 1 (Matsushime et al. 1991 ). Expression 
of Dl cyclin is highly growth factor-dependent and therefore may not be an integral part of 
the internal cell cycle control mechanism, but may occur only in response to external 
signalling (Scherr 1993). The PRAD1 gene, found overexpressed in some parathyroid 
adenomas is identical to the Dl gene (Motokura et al. 1991). Dl has also been found to be 
over-expressed in a glioblastoma cell line (Xiong et al. 1991) and is subject to deregulation 
by gene amplification (Lammie et al. 1 99 1 ; Keyomarsi and Pardee 1 993). Deregulation of D 1 
occurs by unknown mechanisms in some lymphomas, squamous cell tumors and breast 
carcinomas (Bianchi et al. 1993). This protein is involved in activating the growth of cells 
and therefore, deregulated expression of this gene appears to be an oncogenic event. Some 
evidence exists that the E-type cyclin may function in the Gl to S transition in human cells: 
this cyclin binds to and activates p34cdc2 protein in extracts of human lymphoid cells in Gl, 
the protein is associated with histone HI kinase activity in HeLa cells (Koff et al. 1991) and 
cyclin E mRNA is specifically expressed in late G I in HeLa cells (Lew et al. 1 99 1 ). 

It has been hypothesized that p34 cdc2 acts at discrete transition points in the cell 
cycle by phosphorylating varying substrates. The phosphorylating activity is manifest upon 
association of the kinase with cyclins which are differentially expressed throughout the cycle 
of the cell. These different cyclins may alter the substrate specificity of the kinase or may 
alter its catalytic efficiency (Pines and Hunter 1990). Obvious potential substrates for the 
cdc2 kinase are transcription factors that control cell-cycle-stage-specific gene transcription. 

Disruption of any one of the three CLN genes in yeast does not appreciably affect cell 
growth, however, upon disruption of all of the CLN genes, cells arrest in Gl. In addition, in 
response to mating pheromone, the CLN proteins are inhibited and yeast cell growth is 
arrested. Two genes whose products inhibit cyclin activity have been identified in 5. 
cererisiae. The products of the FAR1 and FUS3 genes inhibit CLN2 and CLN3 function, 
respectively. With pheromone signalling, the levels of Farlp and Fus3p increase, the Gl 
cyclins do not accumulate, the CDC28p kinase remains inactive, and cell growth is arrested in 
Gl. These observations suggest that inhibitors of the cyclin proteins, inhibitors of a 
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productive association between the cyclins and the kinase, or inactivators of the kinase can 
foster cellular growth arrest. 

By contrast, cyclins which are uninhibitable appear to function as uncontrolled 
positive growth regulators. High level expression of the CLN proteins is a lethal condition in 
yeast cells. Data indicate that the loss of controlled expression of cyclin Dl through 
chromosomal breakage, chromosomal translocation or gene amplification can promote 
oncogenicity in mammalian cells (Xiong et al. 1991; Lammie et al. 1991; Bianchi et al. 
1993). In addition, it appears that events that disrupt the control of cyclin expression and 
control of cyclin function can result in bypass of the Gl checkpoint and dysregulated cellular 
growth. The cyclin proteins which operate in Gl to promote cellular proliferation would be 
ideal targets for therapeutics aimed at control of cell growth. Candidate surrogate proteins for 
substitution in the yeast pathway for identification of such therapeutics include human cyclins 
C, D and E. All three proteins are normally expressed during the Gl phase of the mammalian 
cell cycle and are thus candidate mediators of the commitment of cells to proliferate. 

Examples of compounds which are known to act in Gl to prevent entry into S phase 
are transforming growth factor p (TGF-p) and rapamycin. Rapamycin, an 
immunosuppressant, inhibits the activity of cyclin E-bound kinase. This macrolide acts in Gl 
to prevent the proliferation of IL-2-stimulated T lymphocytes (Scherr 1993). TGF-p has been 
shown to prevent progression from Gl to S phase in mink lung epithelial cells (Howe et al. 
1991). TGF-P appears to interfere with activation of the kinase, perhaps by reducing the 
stability of the complex which the kinase forms with cyclin E (Koff 1993 ). 

A strain of yeast cells bearing inactive CLN1, CLN2 and CLN3 genes and an 
integrated chimeric gene encoding a Gall promoter-driven human CLN sequence (see DL1 
cells, Lew et al. Cell 66, 1 1 97 ( 1 99 1 ) will serve as a tester strain. The Gal 1 promoter permits 
high level expression when cells are grown in the presence of galactose but this promoter is 
repressed when cells are grown on glucose. Yeast cells so engineered are nonviable on 
glucose due to an absence of expression of functional cyclin. These yeast, however, 
proliferate on galactose-containing medium due to expression of the human cyclin sequence. 
Exposure of this strain to an inhibitor of cyclin function would render the cells incapable of 
growth, even on galactose medium, i.e., the cells would growth arrest in the presence or 
absence of galactose. This phenoty pe could serve as an indication of the presence of an 
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exogenously applied cyclin inhibitor but would not be useful as a screen for the identification 
of candidate inhibitors from members of a random peptide library. Growth arrest of a subset 
of cells in an otherwise growing population is useless as an indicator system. Therefore, in 
order to identify random peptide inhibitors of mammalian cyclins, a two stage screen is 
envisioned. 

A two-hybrid system described by Fields and Song (Nature 340, 245 (1989) permits 
the detection of protein-protein interactions in yeast. GAL4 protein is a potent activator of 
transcription in yeast grown on galactose. The ability of GAL4 to activate transcription 
depends on the presence of an N-terminal sequence capable of binding to a specific DNA 
sequence (UAS G) and a C-terminal domain containing a transcriptional activator. A sequence 
encoding a protein, "A", may be fused to that encoding the DNA binding domain of the 
GAL4 protein. A second hybrid protein may be created by fusing sequence encoding the 
GAL4 transactivation domain to sequence encoding a protein "B". If protein "A" and protein 
"B" interact, that interaction serves to bring together the two domains of GAL4 necessary to 
activate transcription of a UAS G -containing gene. In addition to co-expressing plasmids 
encoding both hybrid proteins, yeast strains appropriate for the detection of protein-protein 
interactions using this two-hybrid system would contain a GALl-lacZ fusion to permit 
detection of transcription from a UAS G sequence. These strains should also be deleted for 
endogenous GAL4 and for GAL80, a negative regulator of GAL4. 
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In a variation of the two-hybrid system just described, the GAL4 DNA binding 
domain would be fused to a human cyclin sequence. In addition, oligonucleotides encoding 
random peptides would be ligated to sequence encoding the GAL4 transactivation domain. 
Co-transformation of appropriate yeast strains w ith plasm ids encoding these two hybrid 
proteins and screening for yeast expressing P-galactosidase would permit identification of 
yeast expressing a random peptide sequence capable of interacting with a human cyclin. 
Identification of peptides with that capability would be the goal of this first stage of 
screening. 

ii. Once random peptides capable of interacting with a human cyclin of interest had been 
identified, second stage screening could commence. The second screen would permit the 
identi-fication of peptides that not only bound to human cyclin but, through that interaction, 
inhibited cyclin activation of the cell cycle-dependent kinase and, thus, cellular proliferation. 
Thus, candidate peptides would be expressed, individually, in yeast lacking CLN1, CLN2 and 
CLN3 but expressing a human CLN sequence, as described above. Those peptides, 
expression of which does not permit growth of the tester strain on galactose, would be 
presumed cyclin inhibitors. 

An advantage to this two-stage approach to the identiflca-tion of potential cyclin 
inhibitors is the high probability that random peptide sequences selected in stage one interact 
with human cyclin proteins. A subsequently determined ability of that sequence to cause 
growth arrest of the tester yeast on galactose would be a strong indication that the growth 
arrest was due to a direct effect of the peptide on the cyclin and not on another protein, e.g., 
the cell cycle dependent kinase. Though a strong indication, such a result would not be an 
absolute indication and verification of the inhibitory effect on cyclin function could be 
obtained in vitro through biochemical assay. 

Screening and Selection 

A marker gene is a gene whose expression causes a phenotypic change which is 
screenable or selectable. If the change is selectable, the phenotypic change creates a 
difference in the growth or survival rate between cells which express the marker gene and 
those which do not. If the change is screenable, the phenotype change creates a difference in 
some detectable characteristic of the cells, by which the cells which express the marker may- 
be distinguished from those which do not. Selection is preferable to screening. 
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The marker gene may be coupled to the yeast pheromone pathway so that expression 
of the marker gene is dependent on activation of the G protein. This coupling may be 
achieved by operably linking the marker gene to a pheromone-responsive promoter. The term 
"pheromone-responsive promoter" indicates a promoter which is regulated by some product 
of the yeast pheromone signal transduction pathway, not necessarily pheromone per se . In 
one embodiment, the promoter is activated by the pheromone pathway, in which case, for 
selection, the expression of the marker gene should result in a benefit to the cell. A preferred 
marker gene is the imidazoleglycerol phosphate dehydratase gene (HIS3). If a pheromone 
responsive promoter is operably linked to a beneficial gene, the cells will be useful in 
screening or selecting for agonists. If it is linked to a deleterious gene, the cells will be useful 
in screening or selecting for antagonists. 

Alternatively, the promoter may be one which is repressed by the pheromone pathway, 
thereby preventing expression of a product which is deleterious to the cell. With a 
pheromone-repressed promoter, one screens for agonists by linking the promoter to a 
deleterious gene, and for antagonists, by linking it to a beneficial gene. 

Repression may be achieved by operably linking a pheromone-induced promoter to a 
gene encoding mRNA which is antisense to at least a portion of the mRNA encoded by the 
marker gene (whether in the coding or flanking regions), so as to inhibit translation of that 
mRNA. Repression may also be obtained by linking a pheromone-induced promoter to a 
gene encoding a DNA-binding repressor protein, and incorporating a suitable operator site 
into the promoter or other suitable region of the marker gene. 
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Suitable positively selectable (beneficial) genes include the following: L'R.43. LYS2 y 
HIS3, LEI 2. TRP1; ADEJ .23.4,5.". 8, ARG 1 ,3.4.5,6,8; HISl.4,5; ILl'1.2.5; THRL4: 
TRP2.3.4J; LEL'1.4; AfET2, 3.4,8.9, I4.16J9; CR4 1 .2,4. 5. 1 0; HO\f3,6; ASP3; CHOI; ARC 
2,~; CYS3; OLEl; ISOl,2.4: PROl.3 Countless other genes are potential selective markers. 
The above are involved in well-characterized biosynthetic pathways. 

The imidazoleglycerol phosphate dehydratase (1GP dehydratase) gene ( HIS3 ) is 
preferred because it is both quite sensitive and can be selected over a broad range of 
expression levels. In the simplest case, the cell is auxotrophic for histidine (requires histidine 
for growth) in the absence of activation. Activation leads to synthesis of the enzyme and the 
cell becomes prototrophic for histidine (does not require histidine). Thus the selection is for 
growth in the absence of histidine. Since only a few molecules per cell of IGP dehydratase 
are required for histidine prototrophy, the assay is very sensitive. 

In a more complex version of the assay, cells can be selected for resistance to 
aminotriazole (AT), a drug that inhibits the activity of IGP dehydratase. Cells with low, fixed 
level of expression of HIS3 are sensitive to the drug, while cells with higher levels are 
resistant. The amount of AT can be selected to inhibit cells with a basal level of HIS3 
expression (whatever that level is) but allow growth of cells with an induced level of 
expression. In this case selection is for growth in the absence of histidine and in the presence 
of a suitable level of AT. 

In appropriate assays, so-called counterselectable or negatively selectable genes may 
be used. Suitable genes include: URA3 (orotidine-5-phosphate decarboxylase; inhibits 
growth on 5-fluoroorotic acid), LYS2 (2-aminoadipate reductase; inhibits growth on a- 
aminoadipate as sole nitrogen source), CYH2 (encodes ribosomal protein L29; 
cycloheximide-sensitive allele is dominant to resistant allele), CAN1 (encodes arginine 
permease; null allele confers resistance to the arginine analog canavanine), and other 
recessive drug-resistant markers. 



The natural response to induction of the yeast pheromone response pathway is for 
cells to undergo growth arrest. This is the preferred way to select for antagonists to a 
ligand receptor pair that induces the pathway. An autocrine peptide antagonist would inhibit 
the activation of the pathway; hence, the cell would be able to grow. Thus, the FAR 1 gene 
may be considered an endogenous counterselectable marker. The FAR1 gene is preferably 
inactivated when screening for agonist activity. 

The marker gene may also be a screenable gene. The screened characteristic may be a 
change in cell morphology, metabolism or other screenable features. Suitable markers 
include beta-galactosidase (Xgal, C i: FDG, Salmon-gal, Magenta-Gal (latter two from 
Biosynth Ag)), alkaline phosphatase, horseradish peroxidase, exo-glucanase (product of yeast 
exbl gene; nonessential, secreted); luciferase; and chloramphenicol transferase. Some of the 
above can be engineered so that they are secreted (although not p-galactosidase). The 
preferred screenable marker gene is beta-galactosidase; yeast cells expressing the enzyme 
convert the colorless substrate Xgal into a blue pigment. Again, the promoter may be 
pheromone-induced or pheromone-inhibited. 

Yeast Cells 

The yeast may be of any species that possess a G protein-mediated signal transduction 
pathway and which are cultivatable. Suitable species include Kluyverei lactis , 
Schizosaccharomyces pombe , and Ustilago maydis ; Saccharomyces cerevisiae is preferred. 
Either Got or Gpy may be the activator of the "effectcr." (It is suspected that in some species, 
both Ga-activated and Gpy-activated effectors exist.) The term "yeast", as used herein, 
includes not only yeast in a strictly taxonomic sense (i.e., unicellular organisms), but also 
yeast-like multicellular fungi with pheromone responses mediated by the mating pathway. 

The yeast cells of the present invention may be used to test peptides for the ability to 
interact with an exogenous G protein-coupled receptor or other PSP surrogate. The yeast 
cells must express both the exogenous G protein-coupled receptor (or other PSP surrogate), 
and a complementary G protein (or other PSPs necessary for the PSP surrogate to function in 
the pheromone system, if need be after activation by a drug), and these molecules must be 
presented in such a manner that a "readout" can be obtained by means of the pheromone 
response pathway (which may be modified to improve the readout). 



For a readout to be possible, a gene encoding a selectable or screenable trait must be 
coupled to the G protein-mediated signal transduction pathway so that the level of expression 
of the gene is sensitive to the presence or absence of a signal, i.e., binding to the coupled 
exogenous receptor. This gene may be an unmodified gene already in the pathway, such as 
the genes responsible for growth arrest. It may be a yeast gene, not normally a part of the 
pathway, that has been operably linked to a "pheromone-responsive" promoter. Or it may be 
a heterologous gene that has been so linked. Suitable genes and promoters were discussed 
above. 

It will be understood that to achieve selection or screening, the yeast must have an 
appropriate phenotype. For example, introducing a pheromone-responsive chimeric HIS3 
gene into a yeast that has a wild-type HIS3 gene would frustrate genetic selection. Thus, to 
achieve nutritional selection, an auxotrophic strain is wanted. 

The yeast cells of the present invention optionally possess one or more of the 
following characteristics: 

(a) the endogenous FAR1 gene has been inactivated; 

(b) the endogenous SST2 gene, and/or other genes involved jn desensitization, has 
been inactivated; 

(c) the endogenous pheromone (a- or a-factor) receptor genejias been inactivated; 

and 

(d) the endogenous pheromone genes have been inactivated. 

"Inactivation" means that production of a functional gene product is prevented or 
inhibited. Inactivation may be achieved by deletion of the gene, mutation of the promoter so 
that expression does not occur, or mutation of the coding sequence so that the gene product is 
inactive. Inactivation may be partial or total. 
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Mutants with inactivated supersensitivity-related genes can be identified by 
conventional genetic screening procedures. The farl gene was identified as an a-factor 
resistant mutant that remained blue (with fusl-lacZ) on a-factor Xgal. far2, as it turns out, is 
the same as fus3. Supersensitive mutants could be identified as constitutive weak blue 
colonies expressing fusl-lacZ on Xgal, or as strains that can mate more proficiently with a 
poor pheromone-secreter. 

The DNA sequences of (a) the a- and a-factor genes, ( b) the a- and a-factor receptors, 
(c) the FAR1 gene, (d) the SST2 gene, and (e) the FUS1 promoter have been reported in the 
following references: 

MFal and MFa2 : AJ Brake, C Brenner. R Najarian, P Laybourn, and J Merryweather. 
Structure of Genes Encoding Precursors of the Yeast Peptide Mating Pheromone a-Factor. In 
Protein Transport and Secretion. Gething M-J, ed. Cold Spring Harbor Lab, New York, 1985. 

MFal and MFa2 : Singh, A. EY Chen, JM Lugovoy, CN Chang, RA Hitzeman et al. 1983. 
Saccharomyces cerevisiae contains two discrete genes coding for the a-pheromone. Nucleic 
Acids Res. 1 1:4049; J Kurjan and I Herskowitz. 1982. Structure of a yeast pheromone gene 
(MF): A putative a-factor precursor contains four tandem copies of mature a-factor. Cell 

30:933. 

STE2 and STE3 : AC Burkholder and LH Hartwell. 1985. The yeast a-factor receptor: 
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69 

FAR 1 : F Chang and I Herskowitz. 1990. Identification of a gene necessary for cell cycle 
arrest by a negative growth factor of yeast: FAR1 is an inhibitor of a G 1 cyclin, CLN2. Cell 
63:999. 
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Saccharomyces cerevisiae SST2 gene: A model for desensitization to pheromone. Mol Cell 
Biol 7: 4169. 
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The various essential and optional features may be imparted to yeast cells by, e.g., one 
or more of the following means: isolation of spontaneous mutants with one or more of the 
desired features; mutation of yeast by chemical or radiation treatment, followed by selection; 
and genetic engineering of yeast cells to introduce, modify' or delete genes. 

Other explicit characteristics desirable in strains of yeast designed to be used as 
screening devices for inhibitors activators of PSP surrogates are discussed in subsections 
dealing specifically with each molecular target. 

Peptide 

The term "peptide" is used herein to refer to a chain of two or more amino acids, with 
adjacent amino acids joined by peptide (-NHCO-) bonds. Thus, the peptides of the present 
invention include oligopeptides, polypeptides, and proteins. Preferably, the peptides of the 
present invention are 2 to 200, more preferably 5 to 50, amino acids in length. The minimum 
peptide length is chiefly dictated by the need to obtain sufficient potency as an activator or 
inhibitor. The maximum peptide length is only a function of synthetic convenience once an 
active peptide is identified. 

For initial studies in which the cognate PSP was a yeast pheromone receptor, a 13- 
amino acid peptide was especialh preferred as that is the length of the mature yeast a-factor. 



Peptide Libraries 
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A "peptide library" is a collection of peptides of many different sequences (typically 
more than 1000 different sequences), which are prepared essentially simultaneously, in such a 
way that, if tested simultaneously for some activity, it is possible to characterize the 
"positive" peptides. 

The peptide library of the present invention takes the form of a yeast cell culture, in 
w hich essentially each cell expresses one. and usually only one, peptide of the library - While 
the diversity of the library is maximized if each cell produces a peptide of a different 
sequence, it is usually prudent to construct the library so there is some redundancy. 

In the present invention, the peptides of the library are encoded by a mixture of DNA 
molecules of different sequence. Each peptide-encoding DNA molecule is ligated with a 
vector DNA molecule and the resulting recombinant DNA molecule is introduced into a yeast 
cell. Since it is a matter of chance which peptide-encoding DNA molecule is introduced into 
a particular cell, it is not predictable which peptide that cell will produce. However, based on 
a knowledge of the manner in which the mixture was prepared, one may make certain 
statistical predictions about the mixture of peptides in the peptide library. 

It is convenient to speak of the peptides of the library as being composed of constant 
and variable residues. If the nth residue is the same for all peptides of the library , it is said to 
be constant. If the nth residue varies, depending on the peptide in question, the residue is a 
variable one. The peptides of the library will have at least one, and usually more than one, 
variable residue. A variable residue may vary among any of two to all twenty of the 
genetically encoded amino acids; the variable residues of the peptide may vary in the same or 
dif-ferent manner. Moreover, the frequency of occurrence of the allowed amino acids at a 
particular residue position may be the same or different. The peptide may also have one or 
more constant residues. 

There are two principal ways in which to prepare the required DNA mixture. In one 
method, the DNAs are synthesized a base at a time. When variation is desired, at a base 
position dictated by the Genetic Code, a suitable mixture of nucleotides is reacted with the 
nascent DNA, rather than the pure nucleotide reagent of conventional polynucleotide 
synthesis. 

The second method provides more exact control over the amino acid variation. First, 
trinucleotide reagents are prepared, each trinucleotide being a codon of one (and only one) of 
the amino acids to be featured in the peptide library. When a particular variable residue is to 
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be synthesized, a mixture is made of the appropriate trinucleotides and reacted with the 
nascent DNA. 

Once the necessary "degenerate" DNA is complete, it must be joined with the DNA 
sequences necessary to assure the expression of the peptide, as discussed in more detail 
below, and the complete DNA construct must be introduced into the yeast cell. 

Expression System 

The expression of a peptide-encoding gene in a yeast cell requires a promoter which is 
functional in yeast. Suitable promoters include the promoters for metallothionein, 3- 
phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255, 2073 (1980) or other glycolytic 
enzymes (Hess et al., J. Ady. Enzyme Reg. 7, 149 (1968); and Holland et al. Biochemistry 1 7 , 
4900 (1978)), such as enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, 
pyruvate decarboxylase, phospho-fructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phospho-glucose 
isomerase, and glucokinase. Suitable vectors and promoters for use in yeast expression are 
further described in R. Hitzeman et al., EPO Publn. No. 73,657. Other promoters, which 
have the additional advantage of transcription controlled by growth conditions, are the 
promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, 
degradative enzymes associated with nitrogen metabolism, and the afore-mentioned 
metallothionein and gIyceraldehyde-3-phosphate dehydrogenase, as well as enzymes 
responsible for maltose and galactose utilization. Finally, promoters that are active in only 
one of the two haploid mating types may be appropriate in certain circumstances. Among 
these haploid-specific promoters, the pheromone promoters MFal and MFal are of particular 
interest. 

In screens devised for a subset of PSP surrogates (e.g. kinases, cyclins) random 
peptide sequences need not be expressed in the context of yeast pheromone and need not be 
engineered for secretion or transport to the extracellular space. Libraries of random peptides 
may be expressed in a multiplicity of ways, including as portions of chimeric proteins, as in a 
two-hybrid protein system designed to signal protein-protein interactions. Random peptides 
need not necessarily substitute for yeast pheromones but can impinge on the pheromone 
pathway downstream of the interaction between pheromone and pheromone receptor (as in 
random peptide inhibitors of the kinases or of the cyclins). 
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In constructing suitable expression plasmids, the termination sequences associated 
with these genes, or with other genes which are efficiently expressed in yeast, may also be 
ligated into the expression vector 3' of the heterologous coding sequences to provide 
polyadenyiation and termination of the mRNA. 

Vectors 

The vector must be capable of replication in a yeast cell. It may be a DNA which is 
integrated into the host genome, and thereafter is replicated as a part of the chromosomal 
DNA, or it may be DNA which replicates autonomously, as in the case of a plasmid. In the 
latter case, the vector must include an origin of replication which is functional in the host. In 
the case of an integrating vector, the vector may include sequences which facilitate 
integration, e.g., sequences homologous to host sequences, or encoding integrases. 

Besides being capable of replication in yeast cells, it is convenient if the vector can 
also be replicated in bacterial cells, as many genetic manipulations are more conveniently 
carried out therein. Shuttle vectors capable of replication in both yeast and bacterial cells 
include YEps, Yips, and the pRS series. 

Periplasmic Secretion 

The cytoplasm of the yeast cell is bounded by a lipid bilayer called the plasma 
membrane. Between this plasma membrane and the cell wall is the periplasmic space. 
Peptides secreted by yeast cells cross the plasma membrane through a variety of mechanisms 
and thereby enter the periplasmic space. The secreted peptides are then free to interact with 
other molecules that are present in the periplasm or displayed on the outer surface of the 
plasma membrane. The peptides then either undergo re-uptake into the cell, diffuse through 
the cell wall into the medium, or become degraded within the periplasmic space. 

The peptide library may be secreted into the periplasm by one of two distinct 
mechanisms, depending on the nature of the expression system to which they are linked. In 
one system, the peptide may be structurally linked to a yeast signal sequence, such as that 
present in the a-factor precursor, which directs secretion through the endoplasmic reticulum 
and Golgi apparatus. Since this is the same route that the receptor protein follows in its 
journey to the plasma membrane, opportunity exists in cells expressing both the receptor and 
the peptide library for a specific peptide to interact with the receptor during transit through 
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the secretory pathway. This has been postulated to occur in mammalian cells exhibiting 
autocrine activation. Such interaction would likely yield activation of the linked pheromone 
response pathway during transit, which would still allow identification of those cells 
expressing a peptide agonist. For situations in which peptide antagonists to externally applied 
receptor agonist are sought, this system would still be effective, since both the peptide 
antagonist and receptor would be delivered to the outside of the cell in concert. Thus, those 
cells producing an antagonist would be selectable, since the peptide antagonist would be 
properly and timely situated to prevent the receptor from being stimulated by the externally 
applied agonist. 

An alternative mechanism for delivering peptides to the periplasmic space is to use 
the ATP-dependent transporters of the STE6/N1DR1 class. This transport pathway and the 
signals that direct a protein or peptide to this pathway are not as well characterized as is the 
endoplasmic reticulum-based secretory pathway. Nonetheless, these transporters apparently 
can efficiently export certain peptides directly across the plasma membrane, without the 
peptides having to transit the ER/Golgi pathway. We anticipate that at least a subset of 
peptides can be secreted through this pathway by expressing the library in context of the a- 
factor prosequence and terminal tetrapeptide. The possible advantage of this system is that 
the receptor and peptide do not come into contact until both are delivered to the external 
surface of the cell. Thus, this system strictly mimics the situation of an agonist or antagonist 
that is normally delivered from outside the cell. Use of either of the described pathways is 
within the scope of the invention. 

The present invention does not require periplasmic secretion, or. if such secretion is 
provided, any particular secretion signal or transport pathway. 
Example 1: Development of Autocrine Yeast Strains 

In this example, we describe a pilot experiment in which haploid cells were engineered to be 
responsive to their own pheromones (see Figure 1). (Note that in the examples, functional 
genes are capitalized and inactivated genes are in lower case.) For this purpose we 
constructed recombinant DNA molecules designed to: 

i. place the coding region of STE2 under the transcriptional control of elements which 
normally direct the transcription of STE3. This is done in a plasmid that allows the 
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replacement of genomic STE3 of S.cerevisiae with sequences wherein the coding sequence of 
STE2 is driven b> STE3 transcriptional control elements. 

it. place the coding region of STE3 under the transcriptional control of elements which 
normally direct the transcription of STE2. This is done in a plasmid which will allow the 
replacement of genomic STE2 of S.cerevisiae with sequences wherein the coding sequence of 
STE3 is driven by STE2 transcriptional control elements. 

The sequence of the STE2 gene is known see Burkholder A.C. and Hartwell L.H. 
(1985), "The yeast a-factor receptor: Structural properties deduced from the sequence of the 
STE2 gene," Nuc. Acids Res. 13, 8463; Nakayama N., Miyajima A., Arai K. (1985) 
"Nucleotide sequences of STE2 and STE3, cell type- 
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specific sterile genes from Saccharomyces cerevisiae," EMBO J. 4. 2643. 

A 4.3 kb BamHI fragment that contains the entire STE2 gene was excised from 
plasmid YEp24-STE2 (obtained from J. Thorner, Univ. of California) and cloned into 
pALTER (Protocols and Applications Guide. 1991, Promega Corporation. Madison, WI). An 
Spel site was introduced 7 nucleotides (nts) upstream of the ATG of STE2 w ith the follow ing 
mutagenic oligonucleotide, using the STE2 minus strand as template (mutated bases are 
underlined and the start codon is in bold type): 

5' GTTAAGAACCATATACTAGTATCAAAAATGTCTG 3' (SEQ ID NO: 14). 

A second Spel site was simultaneously introduced just downstream of the STE2 stop 
codon with the following mutagenic oligonucleotide (mutated bases are underlined and the 
stop codon is in bold type): 

5' TGATCAAAATTTAC TAG TTTGAAAAAGTAATTTCG 3' (SEQ ID NO: 15). 

The BamHI fragment of the resulting plasmid (Cadus 1096), containing STE2 with 
Spel sites immediately flanking the coding region, was then subcloned into the yeast 
integrating vector Yip 19 to yield Cadus 1 143. 

The STE3 sequence is also known. Nakayama N., Miyajima A., Arai K. (1985). 
"Nucleotide sequences of STE2 and STE3, cell type-specific sterile genes from 
Saccharomyces cerevisiae " EMBO J. 4, 2643; Hagen D.C., McCaffrey G., Sprague G.F. 
(1986), 'Evidence the yeast STE3 gene encodes a receptor for the peptide pheromone a- 
factor: gene sequence and implications for the structure of the presumed receptor," Proc. Natl. 
Acad. Sci. 83, 1418. STE3 was made available by Dr. J. Broach as a 3.1 kb fragment cloned 
into pBLUESCRIPT-KS II (Stratagene, 11011 North Torrey Pines Road, La Jolla, CA 
92037). STE3 was subcloned as a KpnI-Xbal fragment into both M13mpl8 RF (to yield 
Cadus 1105) and pUC19 (to yield Cadus 1107). The two Spel sites in Cadus 1107 were 
removed by digestion with Spel, fill-in with DNA polymerase I Klenow fragment, and 
recircularization by blunt-end ligation. Single-stranded DNA containing the minus strand of 
STE 3 was obtained using Cadus 1 105 and Spel sites were introduced 9 nts upstream of the 
start codon and 3 nts downstream of the stop codon of STE 3 with the following mutagenic 
oligonucleotides, respectively: 
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5' GGCAAAATACTAGTAAAATTTTCATGTC 3' (SEQ ID NO: 16). 

5' GGCCCTTAACACACTAGTGTCGCATTATATTTAC 3' (SEQ ID NO: 17). 

The mutagenesis was accomplished using the T7-GEN protocol of United States 
Biochemical (T7-GEN In Vitro Mutagenesis Kit, Descriptions and Protocols, 1991, United 
States Biochemical, P.O. Box 22400, Cleveland, Ohio 44122). The replicative form of the 
resulting Cadus 1141 was digested with Aflll and Kpnl, and the approximately 2 kb fragment 
containing the entire coding region of STE3 flanked by the two newly introduced Spe I sites 
was isolated and ligated with the approximately 3.7 kb vector fragment of Aflll- and Kpnl- 
digested Cadus 1107, to yield Cadus 1138. Cadus 1138 was then digested with Xbal and 
Kpnl, and the STEi-containing 2.8 kb fragment was ligated into the Xbal- and KpnI-digested 
yeast integrating plasmid pRS406 (Sikorski, R.S. and Hieter, P. (1989) "A System of Shuttle 
Vectors and Yeast Host Strains Designed for Efficient Manipulation of DNA in 
Saccharomyces cerevisiae", Genetics 122 :19-27 to yield Cadus 1 145. 

The Spel fragment of Cadus 1 143 was replaced with the Spel fragment of Cadus 1 145 
to yield Cadus 1147, in which the coding sequences of STE3 are under the control of STE2 
expression elements. Similarly, the Spel fragment of Cadus 1 145 was replaced with the Spel 
fragment of Cadus 1143 to yield Cadus 1 148, in which the coding sequences of STE2 are 
under the control of STE3 expression elements. Using the method of pop-in/pop-out 
replacement (Rothstein, R. (1991) "[19] Targeting, Disruption, Replacement, and Allele 
Rescue: Integrative DNA Transformation in Yeast", Methods in Enzvmology , 194 :28 1-301 ), 
Cadus \ 147 was used to replace genomic STE2 with the ste2-STE3 hybrid in a MATa cell 
and Cadus 1148 was used to replace genomic STE3 with the ste3-STE2 hybrid in a MATa 
cell. Cadus 1 147 and 1 148 contain the selectable marker URA3. 
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Haploid yeast of mating type a which had been engineered to express HIS3 under the 
control of the pheromone-inducible FUS1 promoter were transformed with CADUS 1147. 
and transformants expressing URA3 were selected. These transform ants, which express both 
Ste2p and Ste3p, were plated on 5-fluoroorotic acid to allow the selection of clones \vhich 
had lost the endogenous STE2* leaving in its place the heterologous, integrated STES. Such 
cells exhibited the ability to grow on media deficient in histidine, indicating autocrine 
stimulation of the pheromone response pathw ay. 

Similarly, haploids of mating type a that can express HIS3 under the control of the 
pheromone-inducible FUS1 promoter were transformed with CADUS 1 148 and selected for 
replacement of their endogenous STE3 with the integrated STE2. Such cells showed, by their 
ability to grow on histidine-deficient media, autocrine stimulation of the pheromone response 
pathway. 

Example 2: Strain Development 

In this example, yeast strains are constructed which will facilitate selection of clones 
which exhibit autocrine activation of the pheromone response pathway. To construct 
appropriate yeast strains, we will use: the Ylp-STE3 and pRS-STE2 knockout plasmids 
described above, plasmids available for the knockout of FAR 1, SST2. and HIS3, and mutant 
strains that are commonly available in the research community. The following haploid strains 
will be constructed, using one-step or two-step knockout protocols described in Meth. 
EnzymoJ 194:281-301. 1991: 

1. MA 7 a ste 3 : ; S TE2 : ste 3 far I sst2 FUS1::HIS3 

2. MATa ste2 .STE3 :ste2 far! sst2 Fl'Sl::HIS3 

3. MATa ste3::STE2:ste3 far I sst2 mfal mfcQ Fi'Sl ::WS3 

4. MATa ste2::STE3::ste2 far! sst2 mfal mfa2 FL'Sl :!11S3 

5. MATa bar! farl-1 fusl-H!S3 ste!4::TRPl ura3 trpl leu2 his3 

6. MATa mfal mfal farl-1 his3: fits 1-H1S3 sie2-STE3 ura3 met! adel leu2 





78 

Strains 1 and 2 will be tested for their ability to grow on histidine-deficient media as a 
result of autocrine stimulation of their pheromone response pathways by the pheromones 
which they secrete. If these tests prove successful, strain 1 will be modified to inactivate 
endogenous MFal and MFa2 The resulting strain 3, MAT a far I sst2 sre3: :STE2: sre3 
Fi'SI H1S3 mfal mfa2, should no longer display the selectable phenotype (i.e.. the strain 
should be auxotrophic for histidine). Similarly, strain 2 will be modified to inactivate 
endogenous MFal and MFa2. The resulting strain 4. MAT a far I sst2 ste2: STF3: :ste2 
Fi 'SI: H1S3 mfal mfa2, should be auxotrophic for histidine. The uses of strains 5 and 6 are 
outlined in Examples 3 and 4 below . 

Example 3: Peptide Library 

In this example, a synthetic oligonucleotide encoding a peptide is expressed so that 
the peptide is secreted or transported into the periplasm. 

i. The region of MFal which encodes mature a-factor has been replaced via single-stranded 
mutagenesis with restriction sites that can accept oligonucleotides with A fill and Bglll ends. 
Insertion of oligonucleotides with Aflll and Bglll ends will yield plasmids which encode 
proteins containing the MFal signal and leader sequences upstream of the sequence encoded 
by the oligonucleotides. The MFal signal and leader sequences should direct the processing 
of these precursor proteins through the pathway normally used for the transport of mature a- 
factor. 

The MFal gene, obtained as a 1. 8 kb EcoRI fragment from pDA6300 (J. Thorner, 
Univ. of California) was cloned into pALTER (see Figure 2) in preparation for 
oligonucleotide-directed mutagenesis to remove the coding region of mature a-factor while 
constructing sites for acceptance of oligonucleotides with Aflll and Bell ends. The 
mutagenesis was accomplished using the minus strand as template and the following 
mutagenic oligonucleotide: 
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5 ' C T A A A G A A G A AGGGGTATCT TTGCTTA AGC TCGAGATCTC GACTGATAAC 
AACAGTGTAG 3' (SEQ ID NO:18). 

A Hindlll site was simultaneously introduced 7 nts upstream of the MFal start codon 
with the oligonucleotide: 

5'C AT AC AC A AT ATA A AGCTTT A A A A G A A TG A G 3' (SEQ ID NO: 19). 

The resulting plasmid. Cadus 1214, contains a Hindlll site 7 nts upstream of the MFa 
1 initiation codon, an A fill site at the positions which encode the KEX2 processing site in the 
MFal leader peptide, and Xhol and Bglll sites in place of all sequences from the leader- 
encoding sequences up to and including the normal stop codon. The 1 .5 kb Hindlll fragment 
of Cadus 1214 therefore provides a cloning site for oligonucleotides to be expressed in yeast 
and secreted through the pathway normally travelled by endogenous a-factor. 

A sequence comprising the ADlll promoter and 5' flanking sequence was obtained as 
a 1.5 kb BamHI-Hindlll fragment from pAAH5 (Ammerer, G. (1983) "[11] Expression of 
Genes in Yeast Using the ADCI Promoter". Academic Press, Inc.. Meth. Enzymol. 101. 192- 
201 and ligated into the high copy yeast plasmid pRS426 (Christianson. T.W et al. (1992) 
"Multifunctional yeast high-copy-number shuttle vectors". Gene 110 :1 19-122) (see Figure 3). 
The unique Xhol site in the resulting plasmid was eliminated to yield Cadus 1 186. The 1.5 
Kb Hindlll fragment of Cadus 1214 was inserted into Hindlll-digested Cadus 1186; 
expression of sequences cloned into this cassette initiates from the A/Jf/I promoter. The 
resulting plasmid. designated Cadus 1215, can be prepared to accept oligonucleotides with 
Aflll and Bell ends by digestion with those restriction endonucleases. The oligonucleotides 
will be expressed in the context of MFal signal and leader peptides (Figure 4). 

Modified versions of Cadus 1215 were also constructed. To improve the efficiency of 
ligation of oligonucleotides into the expression vector, Cadus 1215 was restricted with Kpnl 
and religated to yield Cadus 1337. This resulted in removal of one of two Hindlll sites. 
Cadus 1337 was linearized with Hindlll. Fitled-in, and recircularized to generate Cadus 1338. 
To further tailor the vector for library construction, the following double- 



stranded oligonucleotide was cloned into Aflll- and Bglll- digested Cadus 1338: 

5' TTA AGCGTGAGGCAGA AGCTTATCG ATA oligo 062(SEQ IDNO:37) 

3' CGCACTCCGTCTTCGA ATAGC TATCTAG oligo 063(SEQ ID NO:3S) 

The Hindlll site is italicized and a Clal site is emboldened; this Clal site is unique in 
the resulting vector. Cadus 1373. In Cadus 1373. the Hindlll site that exists at the junciton 
between the MFct pro sequence and the mature peptide to be expressed by this vector was 
made unique. Therefore the Hindlll site and the downstream Bglll site can be used to insert 
oligo-nucleotides encoding peptides of interest. These modifications of Cadus 1215 prov ide 
an laternative to the use of the Aflll site in the cloning of oligonucleotides into the 
expressions vector. 

Cadus 1373 was altered further to permit elimination from restricted vector 
preparations of contaminating singly-cut plasmid. Such contamination could result in 
unacceptably high background transformation. To eliminate this possibility, approximately 
1.1 kb of dispensable ADH1 sequence at the 5' side of the promoter region was deleted. This 
was accomplished by restruction of Cadus 1373 with SphI and BamHl, fill-in. and ligation; 
this maneuver regenerates the BamHI site. The result-ing vector, Cadus 1624, was then 
restricted with Hindlll and Clal and an approximately 1.4 kb Hindlll and Clal fragment 
encoding lacZ was inserted to generate Cadus 1625. Use of Hindlll- and Bglll-restricted 
Cadus 1625 for acceptance of oligonucleotides results in a low background upon 
transformation of the ligation product into bacteria. 

Two single-stranded oligonucleotide sequences (see below) are synthesized, annealed, 
and repetitively filled in, denatured, and reannealed to form double-stranded oligonucleotides 
that, when digested with Aflll and Bell, can be ligated into the polylinker of the expression 
vector, Cadus 1215. The two single-stranded oligonucleotides have the following sequences: 
5' G CTA CTT A AG CGT GAG GCA GA A GCT 3' (SEQ ID NO:20) and 
5' C GGA TGA TCA <NNN) n AGC TTC TGC CTC ACC. CTT A A G TAG C 3' (SEQ ID 
NOS:21 and 1 18 ) where N is any chosen nucleotide and n is any chosen integer. Yeast 
transformed with the resulting plasmids will secrete — through the ct-factor secretory pathway 



— peptides whose amino acid sequence is determined b> the particular choice of N and n 
( Figure 4 ). 

Alternatively, the following single stranded oligonucleotides are used: 

MFaNNK (76 mer): 

5'CTGClVTCCGA 

NNK 

TGATCAGTCTGTGACGC 3' (SEQ ID NO:39) 
and MFaMbo ( 17 mer): 

5' GCGTCACAG ACTG ATCA 3' (SEQ ID NO:40) 

W hen annealed the double stranded region is: 

TOM / ( 'AGTCTGTGACGC (last 17 bases of SEQ ID NO:39) 

A ( 7 . R TF C A G A C A C T G C G (SEQ ID NO:40 in 3' to 5* direction) 

where the Fokl site is underlined, the Bbsl site is emboldened, and the Mbol site is italicized. 
After fill-in using Taq DNA polymerase (Promega Corporation, Madison. Wisconsin), the 
double stranded product is restricted with Bbsl and Mbol and ligated to Hindlil- and BgHI- 
restricted Cadus 1373. 



ii. The region of M Fa I which encodes mature a-factor will be replaced via single stranded 
mutagenesis with restriction sites that can accept oligonucleotides with Xhol and A fill ends. 
Insertion of oligonucleotides with Xhol and Aflll ends will yield plasmids which encode 
proteins containing the MFal leader sequences upstream of the sequence encoded by the 
oligonucleotides. The MFal leader sequences should direct the processing of these precursor 
proteins through the pathway normally used for the transport of mature a-factor. 
A//\ IF obtained as a BamHI fragment from pKKl (provided by J. Thorner and K. Kuchler), 
was ligated into the BamHI site of pALTER (Promega) (Figure 5). Using the minus strand 
of MFAl as template, a Hindlil site was inserted by oligonucleotide-directed mutagenesis just 
5' to the MFAl start codon using the following oligonucleotide: 

5' CCA AAATAAGTACAAAGCTTTCGAATAGAA ATGCAACCATC (SEQ ID NO:22). 
A second oligonucleotide was used simultaneously to introduce a short polylinker for later 
cloning of synthetic oligonucleotides in place of MFAl sequences. These MFAl sequences 
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encode the C-terminal 5 amino acids of the 21 amino acid leader peptide through to the stop 
codon: 

5'QCCGC TC C A A A A G A A A A G A C C TC G A G C TCG C T T A A G TTCTG C G T A C A 

AAAACGTTGTTC 3' (SEQ ID NO:23). The 1.6 kb Hindlll fragment of the resulting 
plasmid, Cadus I 172. contains sequences encoding the MFAI start codon and the \-terminal 
16 amino acids of the leader peptide, followed by a short polylinker containing Xhol. Sacl. 
and Aflll sites for insertion of oligonucleotides. The 1.6 kb Hindlll fragment of Cadus 1 172 
was ligated into Hindlll-digested Cadus 1186 (see above) to place expression of sequences 
cloned into this cassette under the control of the ADlll promoter. The Sacl site in the 
polylinker was made unique by eliminating a second Sacl site present in the vector. The 
resulting plasmid, designated Cadus 1239. can be prepared to accept oligonucleotides with 
Xhol and Aflll ends by digestion with those restriction endonucleases for expression in the 
context of MFal leader peptides (Figure 6). 

Two single-stranded oligonucleotide sequences (see below ) are synthesized, annealed, and 
repetitively filled in. denatured, and reannealed to form double-stranded oligonucleotides that, 
when digested with Aflll and Bglll, can be cloned into the polylinker of the expression vector, 
Cadus 1239. The two single-stranded oligonucleotides used for the cloning have the 
follow ing sequences: 

5' GG TAC TCG AGT GAA A AG A AG GAC AAC 3' (SEQ IO NO:24) 

5' CG TAC TTA AGC A AT AAC ACA (NNN) n GTT GTC CTT CTT TTC ACT CGA C.TA 
CC y (SEQ ID NOS:25 and 1 19 ) 

where N is any chosen nucleotide and n is any chosen integer. Yeast transformed with the 
resulting plasmids will transport 

— through the pathway normally used for the export of a-factor 

— farnesviated, carboxymethylated peptides whose amino acid sequence is determined by the 
particular choice of N and n (Figure 6). 

Example 4. Peptide Secretion/Transport. 

This example demonstrates the ability to engineer yeast such that they secrete or 
transport oligonucleotide-encoded peptides (in this case their pheromones) through the 
pathways normally used for the secretion or transport of endogenous pheromones. 



Autocrine MATa strain C \ 588: 

A MATa strain designed for the expression of peptides in the context of MFctl (i.e.. 
using the MFctl expression vector. Cadus 1215) has been constructed. The genotype of this 
strain, which we designate CY588, is \fATa burl far 1-1 fusl-lllS3 steN TRPI uraS 
trpl leu2 ///.si. The burl mutation eliminates the strain's ability to produce a protease that 
degrades a-factor and that may degrade some peptides encoded by the cloned 
oligonucleotides; the furl mutation abrogates the arrest of growth which normally follows 
stimulation of the pheromone response pathway: an integrated FLS1-HIS3 hybrid gene 
provides a selectable signal of activation of the pheromone response pathway; and. finally, the 
steN mutation lowers background of the FLS1-HIS3 readout. The enzymes responsible for 
processing of the MFctl precursor in MATa cells are also expressed in MATa cells (Sprague 
and Thorner, in The Molecular and Cellular Biology of the Yeast Saccharomyces: Gene 
Expression . 1992, Cold Spring Harbor Press), therefore. CY588 cells should be able to 
secrete peptides encoded by oligonucleotides expressed from plasmid Cadus 1215. 

A high transforming version (tbt\-\) of CY588 was obtained by crossing CY1013 
(CY588 containing an episomal copy of the STEM gene)(MATa burl : ; hisGfarb 1 fusl-1 1 1S3 
steN TRPI ura3 trpl leu2 his3 /STEN I RA3 CEX4]) to CY793 (MATa tbtl-1 ura3 leu2 
trpl hi.;3 fi<sl-HlS2 canl stel N: TRPI [FUS1 LEL2 2u.]> and selecting from the resultant 
spores a strain possessing the same salient genotype described for CY538 (see above), and in 
addition the tbl-1 allele, which confers the capacity for very high efficiency transformation 
by electroporation. The selected strain is CY1455 (\\ATabarf::hisGfarf-J fusl-lllS3 
steN TRPI tbt-l ura3 trpl Ieu2 his3). 

Secretion of peptides in the context of yeast a-factor: 

Experiments were performed to test: 1. the ability of Cadus 1215 to function as a vector for 
the expression of peptides encoded by synthetic oligonucleotides; 2. the suitability of the 
oligonucleotides, as designed, to direct the secretion of peptides through the a-factor 
secretory pathway: 3. the capacity of CY588 to secrete those peptides; and 4. the ability of 
CY588 to respond to those peptides that stimulate the pheromone response pathway by 
growing on selective media. These experi-ments were performed using an oligonucleotide 
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which encodes the 13 amino acid a-factor: i.e.. the degenerate sequence (NNN> n in the 
oligonucleotide cloned into Cadus 1215 (see above) was specified (n=13) to encode this 
pheromone. CY588 was transformed with the resulting plasmid (Cadus 1219). and 
transformants selected on uracil-deficient medium were transferred to histidine-deficient 
medium supplemented with a range of concentrations of aminotriazole (an inhibitor of the 
//AS' J gene product that serves to reduce background growth). The results, shown in Figure 7. 
demonstrate that the synthetic oligo-nucleotide. expressed in the context of MFal by Cadus 
1215. conferred upon CY588 an ability to grow on histidine-deficient media supplemented 
with aminotriazole. In summation, these data indicate that: 1. CV588 is competent for the 
secretion of a peptide encoded by the (\\N) n sequence of the synthetic oligonucleotide 
cloned into and expressed from Cadus 1215; and 2. CY588 can. in an autocrine fashion, 
respond to a secreted peptide which stimulates its pheromone response pathway, in this case 
by a-factor binding to STE2. 

Additional experiments were performed to test the utility of autocrine yeast strains in 
identifying agonists of the Ste2 receptor from among members of two semi-random a-factor 
libraries. a-Mid-5 and MFa-8. 
a-\iid-5 Library 

A library of semi-random peptides, termed the a-Mid-5 library , was constructed. In 
this library, the \-terminal four amino acids and the C-terminal four amino acids of a 13 
residue peptide are identical to those of native a-factor while the central five residues 
(residues 5-9) are encoded by the degenerate sequence (NNQh. The following 
oligonucleotides were used in the construction of the a-Mid-5 library: 
( I )MFaMbo. a 17 mer: 

5' GCGTCACAGACTGATCA (SEQ ID \O:40) 

( 2 )MID5 ALF. a 71 mer: 

5' 

G C C G TC A G T A A A G CTT GG C A TTG G TTG NN QN N QN NQNNQMM OCA G C C T A TG T A C 
TGATC 

AGTCTGTGACGC (SEQ ID NO:41 ) 

Sequenase (United States Biochemical Corporation, Cleveland. Ohio) was used to complete 
the duplex formed after annealing MFaMbo to the MID5ALF oligonucleotide. In the 
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MID5ALF sequence. N indicates a mixture of A. C, G, and T at ratios of 0.8:1:1.3:1; Q 
indicates a mixture of C and G at a ratio of 1 : 1 .3. These ratios were employed to compensate 
for the different coupling efficiences of the bases during oligonucleotide synthesis and were 
thus intended to normalize the appearance of all bases in the library. In the oligonucleotide 
sequences above, the Hindlll site is underlined and the Mbol site is emboldened. The double- 
stranded oligonucleotide was restricted with Hindlll and Mbol and ligated to Cadus 1625 (see 
above); Cadus 1625 had been prepared to accept the semi-random oligonucleotides by 
restriction with Hindlll and Bglll. 

The apparant complexity of the aMid-5 library is 1 x 10 . This complexity is based 
on the number of bacterial transformants obtained with the library DNA versus transformants 
obtained with control vector DNA that lacks insert. Sequence analysis of six clones from the 
library demonstrated that each contained a unique insert. 

To identify peptide members of the a-Mid-5 library that could act as agonists on the STE2 
receptor, CY1455, a high transforming version of CY588, was electroporated to enhance 
uptake of a-Mid-5 DNA. Transformants were selected on uracil-deficient (-Ura) synthetic 
complete medium and were transferred to histidine-deflcient (-His) synthetic complete 
medium supplemented with 0.5mM or ImM aminotriazole. 

Yeast able to grow on -His + aminotriazole medium include (1) yeast which are 
dependent on the expression of an a -factor variant agonist and (2) yeast which contain 
mutations that result in constitutive signalling along the pheromone pathway. Yeast 
expressing and secreting a variant STE2 receptor agonist have the ability to stimulate the 
growth on -His medium of surrounding CY 1455 cells that do not express such an agonist. 
Thus a recognizable formation (termed a "starburst") results, consisting of a central colony, 
growing by virtue of autocrine stimuation of the pheromone pathway , surrounded by satellite 
colonies, growing by virtue of paracrine stimulation of the pheromone pathway by the agonist 
peptide as that ptpetide diffuses radially fromn the central, secreting colony. 

In order to identify the peptide sequence responsible for this "starburst" phenomenon, 
yeast were transferred from the center of the "starburst" and streaks were made on -Ura 
medium to obtain single colonies. Individual clones from -Ura were tested for the His* 
phenotype on -His + aminotriazole plates containing a sparse lawn of CY1455 cells. 
Autocrine yeast expressing a peptide agonist exhibited the "starburst" phenotype as the 
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secreted agonist stimulated the growth of surrounding cells that lacked the peptide but were 
capable of responding to it. Constitutive pheromone pathway mutants were capable of 
growth on -His - aminotriazole but were incapable of enabling the grwoth of surrounding 
lawn cells. 

Alternatively, streaks of candidate autocrine yeast clones were made on plates 
containing 5-fluoroorotic acid (FOA) to obtain L'ra segregants were retested on -His - 
aminotriazole for the loss of the His* phenotype. Clones that lost the ability to grow on -His 
^ aminotriazole after selection on FOA (and loss of the peptide-encoding plasmid) derived 
from candidate expressors of a peptide agonist. The plasmid was rescued from candidate 
clones and the peptide sequences determined. In addition, a plasmid encoding a putative Ste2 
agonist was reintroduced into CY1455 to confirm that the presence of the plasmid encoding 
the peptide agonist conferred the His' phenotype to CY 1 455. 

By following the above protocol novel Ste2 agonists have been identified from the a-Mid-5 
library. Sequences of nine agonists follow, preceded by the sequence fo the native a-factor 
pheromone and by the oligonucleotide used to encode the native pheromone in these 
experiments. (Note the variant codons used in the a-factor-encoding oligonucleotide for 
glutamine and proline in the C-terminal amino acids of a-factor). Below each nucleotide 
sequence is the encoded amino acid sequence with variations from the native-pheromone 
underlined. 



a-factor TGG CAT TGG TTG CAG CTA AAA CCT GGC CAA CCA ATG TAC 
encodes Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 

(DNA, SEQ ID NO:42, A AS, SEQ ID NO:43) 



a-factor 

oligo: TGG CAT TGG TTG CAG CTA AAA CCT GGC CAG CCT ATG TAC (SEQ ID 
NO:9l ) 
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encodes Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tvr (SEQ ID No:92) 

Ml TGG CAT TGG TTG TCC TTG TCG CCC GGG C AG CCT ATG TAC 

(SEQ ID NO:Q3) 

encodes Trp His Trp Leu Ser Leu Ser Pro Gly Gin Pro Met Tvr (SEQ ID NO:Q4) 

M2 TGG CAT TGG TTG TCC CTG G AC GCT GGC CAG CCT ATG TAC 

(SEQ IDNO:95) 

encodes Trp His Trp Leu Ser Leu As£ Ala Gly Gin Pro Met Tvr (SEQ ID NQ:%) 

M3 TGG CAT TGG TTG ACC TTG ATG GCC GGG CAG CCT ATG TAC 

( SEQ ID NO:Q7) 

encodes Trp His Trp Leu Thr Leu Met Ala Gly Gin Pro Met Tvr (SEP ID NO:98> 

M4 TGG CAT TGG TTG CAG CTG TCG GCG GGC CAG CCT ATG TAC 

(SEQ ID NO:9 9> 

encodes Trp His Trp Leu Gin Leu Ser Aja Gly Gin Pro Met Tvr (SEQ ID NO: 100) 

M5 TGG CAT TGG TTG AGG TTG CAG TCC GGC CAG CCT ATG TAC 

(S EQ ID NO:101 ) 

encodes Trp His Trp Leu Arg Leu Gin Ser Gly Gin Pro Met Tvr (SEQ ID NO: 102) 

Mb TGG CAT TGG TTG CGC TTG TCC GCC GGG CAG CCT ATG TAC 

(SEQ ID NO:103) 

encodes Trp His Trp Leu Arg Leu Ser Ala Gly Gin Pro Met Tvr (SEQ ID NO: 104) 

M7 TGG CAT TGG TTG TCG CTC GTC CCG GGG CAG CCT ATG TAC 

(SEQ ID NO:105) 

encodes Trp His Trp Leu Ser Leu Val Pro Gly Gin Pro Met Tvr (SEQ ID NO:I06) 
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M8 TGG CAT TGG TTG TCC CTG TAC CCC GGG CAG CCT ATG TAG 

( SEQ ID NO:107) 

encodes Trp His Trp Leu Ser Leu Tvr Pro Gly Gin Pro Met Tvr lSEO ID NO: 108) 

M9 TGG CAT TGG TTG CGG CTG CAG CCC GGG CAG CCT ATG TAC 

(SEQ ID NO:10Q) 

encodes Trp His Trp Leu Arg Leu Gin Pro Gly Gin Pro Met Tvr (SEQ ID NO: 1 10) 
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The nine peptide agonists of the Ste2 receptor above were derived from one 
electroporation of CY 1455 using 1 ug of the a-Mid-5 library DNA. Approximately 3 x 10' 
transformants were obtained, representing approximately .03° o of the sequences present in 
that library. 

MFa-8 Library 

A semi-random a-factor library was obtained through synthesis of mutagenized ce- 
faclor oligonucleotides such that 1 in 10.000 peptide products were expected to be genuine ct- 
factor. The mutagenesis was accomplished with doped synthesis of the oligonucleotides: 
each nucleotide was made approximately 68°o accurate by synthesizing the following two 
oligos: 

5' C T GGATG C G \\ GACTCAGCT (20 mer)(oligo060) (SEQ ID NO:44) 
where the Fokl site is underlined and the Bbsl site is emboldened. 

5' CGGATCiA J'CA gta cat tgg ttg gec agg ttt tag ctg caa cca atg cca AGC TGA GTC 
TTC GC A TCC AG (69 mer)(oligo074) (SEQ ID NO:45) 

where the 8cll site is italicized, the Fokl site is underlined, the Bbsl site is emboldened. The 
lower case letters indicate a mixture of 67% of that nucleotide and 1 1% of each of the other 
three nucleotides (e.g. t indicates 67% T and 11% A. 11% C. and 11% G). Note that 
digestion of the double-stranded oligo-nucleotide by Fokl or Bbsl will yield an identical 5' 
end that is compatible with Hindi 1 1 ends. 

Oligos 060 and 074 will form the following double-stranded molecule when annealed: 
5' CT GGATG CGAAGACTCAGCX {SEQ ID NQ:44) 

3* GA CCTAC GCTTCTGAGTCGA acc gta acc aac gtc gat ttt gga ccg gtt ggt tac atg 
ACTAGTAGGC 5' (SEP ID NO:125) 



Deleted: 



The duplex was repetitively filled-in using Taq DNA polymerase (Promega 
Corporation, Madison. W isconsin). The double-stranded product was restricted with Bbsl 
and Bell and ligated into Hindlll- and Bglll-digested Cadus 1373. The Belli Bell joint 
creates a TGA stop codon for the termination of translation of the randomers. L'sing this 
approach, the MFa-5.8 library (a library of apparent low complexity based on PCR analysis 
of oligonucleotide insert frequency) was constructed. 

To identify peptide members of the MFa-5.8 library that could act as agonists on the STE2 
receptor. CY1455. a high transforming version of CY588. was electroporated to enhance 
uptake of MFa-5.8 DNA. Transformants were selected on uracil-deficient (-Ural synthetic 
complete medium and were transferred to histidine-deficient (-His) synthetic complete 
medium supplemented with 1.0 mM or 2.5 mM aminotriazole. Yeast from colonies which 
were surrounded by satellite growth were transferred as streaks to 

-Lira medium to obtain single colonies. Yeast from single colonies wree then tested for the 
His* phenotype on -His - aminotriazole plates. Sequence analysis of seven of the plasmids 
rescued from His' yeast revealed three unique a-factor variants that acted as agonists on the 
S TE2 receptor. 

1.4 independent clones had the following sequence: 

TGG CAT TGG CTA CAG C TA ACG CCT GGG CA A CCA ATG TAC 
(SEQ ID vO:46) 

encoding Trp His Trp Leu Gin Leu Thr Pro Glv Gin Pro Met Tyr 

(SEQ ID NO:47) 
2.2 independent clones had the following sequence: 

TCiG CAT TGG CTG GAG CTT ATG CCT GGC CAA CCA TTA TAC 

(SEQ ID NO:48) 

encoding Trp His Trp Leu Glu Leu Met Pro Gly Gin Pro Leu Tyr 
(SEQ ID NO:4<>) 

3. TGG CAT TGG ATG GAG CTA AGA CCT GGC CAA CCA ATG TAC 

(SEQ ID NO:50) 

encoding Trp His Trp Met Glu Leu Arg Pro Gly Gin Pro Met Tyr 
<SEQIDNO:51) 



Autocrine Mata strain CY599: 

A MATa strain designed for the expression of peptides in the context of MFAl (i.e.. 
using the MFAl expression vector. Cadus 1239) has been constructed. The genotype of this 
strain, designated CY599. is MATa mfal mfa2 furl- 1 his3 fusl-tIIS3 ste2-STE3 ura3 
met I tide I leu 2. In this strain, Cadus 1 147 (see above) was used to replace STE2 with a 
hybrid gene in which the STE3 coding region is under the control of expression elements 
which normally drive the expression of STE2. As a result, the a-factor receptor replaces the 
cx-factor receptor. The genes which encode a-factor are deleted from this strain; the far I 
mutation abrogates the arrest of growth which normally follow s stimulation of the pheromone 
response pathway; and the FUS1-HIS3 hybrid gene (integrated at the HIS3 locus) provides a 
selectable signal of activation of the pheromone response pathway. CY599 cells were 
expected to be capable of the transport of a-factor or a-factor-like peptides encoded by 
oligonucleotides expressed from Cadus 1239 by virtue of expression of the endogenous yeast 
transporter, Ste6. 

Transport of peptides by the yeast a-J"aetor pathway: 

Experiments were performed to test: 1. the ability of Cadus 1239 to function as a 
v ector for the expression of peptides encoded by synthetic oligonucleotides; 2. the suitability 
of the oligonucleotides, as designed, to direct the export of farnesylated, carboxy methylated 
peptides through the pathway normally used by a-factor; 3. the capacity of CY599 to export 
these peptides; and 4. the ability of CY599 to respond to those peptides that stimulate the 
pheromone response pathway by growing on selective media. These tests were performed 
using an oligonucleotide which encodes the 12 amino acid a-factor; specifically, the 
degenerate sequence (NNN) n in the oligo-nucleotide cloned into Cadus 1239 (see above) 
(with n=12) encodes the peptide component of a-factor pheromone. CY599 was trans- 
formed with the resulting plasmid (Cadus 1220), and transformants selected on uracil- 
deflcient medium were transferred to histidine-deficient medium supplemented with a range 
of concentrations of aminotriazole. The results, shown in Figure 8, demonstrate that the 



92 



synthetic oligonucleotide, expressed in the context of MFA1 by Cadus 1220. conferred upon 
CY599 enhanced aminotriazole-resistant growth on histidine-deficient media. In summation, 
these data indicate: !. Cadus 1220 and the designed oligonucleotide are competent to direct 
the expression and export of a farnesylated. carboxymethylated peptide encoded by the 
(NNN) n sequence of the synthetic oligonucleotide; and 2. CY599 can, in an autocrine 
fashion, respond to a farnesylated, carboxy-methylated peptide that stimulates its pheromone 
response pathway, in this case signaling initiates as a-factor binds to STE3. 

Example 5. Proof of Concept 

This example w ill demonstrate the utility of the autocrine system for the discovery of 
peptides which behave as functional pheromone analogues. By analogy, this system can be 
used to discover peptides that productively interact with any pheromone receptor surrogates. 

CY588 (see strain 5. Example 2 above) will be transformed with CADUS 1215 
containing oligonucleotides encoding random tridecapeptides for the isolation of functional a 
-factor analogues (Figure 4) CY599 (see strain 6, Example 2 above) will be transformed 
with CADUS 1239 containing oligos of random sequence for the isolation of functional a- 
factor analogues (Figure 6). Colonies of either strain which can grow on histidine-deficient 
media following transformation will be expanded for the preparation of plasmid DNA. and 
the oligo-nucleotide cloned into the expression plasmid will be sequenced to determine the 
amino acid sequence of the peptide which presumably activates the pheromone receptor. This 
plasmid will then be transfected into an isogenic strain to confirm its ability to encode a 
peptide which activates the pheromone receptor. Successful completion of these experiments 
will demonstrate the potential of the system for the discovery of peptides which can activate 
membrane receptors coupled to the pheromone response pathway. 

Random oligonucleotides to be expressed by the expression plasmid CADUS 1215 
will encode tridecapeptides constructed as 

5' CGTGAAG CTTAAG CGTGAGGCAGAAGCT(NNK)„ TGATCA TCCG. (SEQ ID NO:6) 
where N is any nucleotide, K is either T or G at a ratio of 40:60 (see Proc Natl Acad Sci 
87:6378. 1990: ibid 89:5393. 1992). and the Aflll and Bell sites are underlined. This 
oligonucleotide is designed such that: the Aflll and Bell sites permit inserting the oligos into 



the A fill and Bglll site of CADL'S 1215 (see Figure 4); the Hindlll site just 5' to the Aflll site 
in the 5' end of the oligo allows future flexibility with cloning of the oligos: the virtual repeat 
of GAGGCT and the GAGA repeats which are present in the wild-type sequence and which 
can form triple helixes are changed without altering the encoded amino acids. The random 
oligonucleotides described above will actually be constructed from the following two oligos: 
5' CGTGAAGCTTAAGCGTGAGGCAGAAGCT (SEQ ID NO:26> and 
5' CGGATGATCA(\l\N) i; AGCTTCTG (SEQ ID NO:27). 

where M is either A or C at a ratio of 40:60. The oligos will be annealed w ith one another 
and repetitively filled in. denatured, and reannealed (Ray et al. Gene. 1993). The double- 
stranded product will be cut with Aflll and Bell and ligated into the Aflll-and BgUI-digested 
CADL'S 1215. The Bglll Bell joint will create a TGA stop codon for termination of 
translation of the randomers (Figure 4). Because of the TA content of the Afl overhang, the 
oligos will be ligated to the Aflll-and BgUI-digested pADC-MFa at 4°C. 

Random oligonucleotides to be expressed by the expression plasmid CADLS 1239 
will encode monodecapeptides constructed as 

5' GGTA CTCGAG TGA AAAGAAGGACAAC(NNK ),,TGTGTTATTG CTTAAG TACG 
(SEQ ID NO:12), 

where N is any nucleotide. K is either T or G at a ratio of 40:60 (see Proc Natl Acad set 
87:6378. 1990; ibid 89:5393. 1992). and the Xhol and Aflll sites are underlined. When 
cloned into the Xhol and Aflll sites of CADLS 1239 the propeptides expressed under the 
control of the ADH1 promoter will contain the entire leader peptide of MFal. followed by 1 1 
random amino acids, followed by triplets encoding CVTA (the C-terrninal tetrapeptide of 
wild-type a-factor). Processing of the propeptide should result in the secretion of 
dodecapeptides which contain 1 1 random amino acids followed by a C-terminal, farnesv lated, 
carboxymethylated cysteine. 

L'sing the procedure described above, the oligonucleotides for expression in CADLS 
1239 will actually be constructed from the following two oligos: 
5' GGTACTCGAGTGAA A AGAAGGACAAC ( SEQ ID NO:28) and 
5' CGTACTTAAGCA ATA ACAca(MNN) n GTTGTCC (SEQ ID NO:29). 
where M is either A or C at a ratio of 40:60. and the Xhol and Aflll sites are underlined. 



Discovery of a-factor analogues from a random peptide library 

An optimized version of strain 6 (Example 2 above) was derived. This yeast strain. 
CY2012 l AM 7 a ste2-SIE3 far 1 AI 442 mfaE LEl 2 mfa2-lacZ fusI~HIS3 ibll-l uraS Ieu2 
his3 trpl suc2)^ was constructed as follows. From a cross of CY570 (A/. J 7a mfal LEA. 2 
mfa2-iacZ ura3 trpl his3A200 can! Ieu2 fusl-HIS3 [ME A I LRA3 2^i][El'SI A8-'3 ERR I 
CEX6]) by CY1624 (\fATa thtl-l fusl-HIS3 trpl ura3 leu2 his3 lys2-80l SEC-), a spore 
was selected (CY1877) of the following genotype: A/.J/a mfaE LEI 2 mfa2-lacZ fits I -HIS3 
tbtl-1 ura3 leu2 his3 trpl suc2. This strain lacks both genes {MEAI and MFA2) encoding a- 
factor precursors, contains the appropriate pheromone pathway reporter gene {fits I -HIS3). and 
transforms by electroporation at high efficiency (tbtl-I). This strain was altered by deletion 
of the EAR! gene (with Cadus 1442; see Example 6), and replacement of STL12 coding 
sequences with that of SEE3 (see Example 1) to yield CY2012. 

This strain was transformed with plasmid DNA from a random a-factor library by 
electroporation and plated on 17 synthetic complete plates lacking uracil (-Ura), yielding 
approximately 10" Ura* colonies per plate after 2 days at 30°C. These colonies were replica 
plated to histidine-deflcient synthetic complete media (-His) containing 0.2 mM 3- 
aminotriazole and after three days at 30°C 35 His' replicas were streaked to -Ura plates. The 
resultant colonies, 3 from each isolate, were retested for their His' phenotype, and streaked to 
5-fluoroorotic acid plates to obtain Ura" segregants (lacking a library plasmid). Those Ura" 
segregants were tested for the loss of their His" phenotype. Ten of the original isolates passed 
these tests; in two cases only one of the three Ura' colonies purified from the isolate retained 
the His' phenotype, but nevertheless subsequently segregated Ura" His" colonies. 

A single plasmid (corresponding to a bacterial colony ) was obtained from each of the 
ten isolates, and reintroduced into CY2012. Eight of the ten plasmids passed the test of 
retaining the ability to confer the His' phenotype on CY2012 (the two that failed correspond 
to the two isolates that were mentioned above, suggesting that these isolates contain at least 
one Irrelevant'' plasmid). Sequencing of the randomized insert in the eight plasmids of 
interest revealed that four contain the sequence: 

TAT GCT CTG TTT GTT CAT TTT TTT GAT ATT CCG (SEQ ID NO:52) 



T\r Ala Leu Phe \'al His Phe Phe Asp He Pro. (SEQ ID NO:53) 
two contain the sequence: 

TTT A AG GGT CAG GIG CGT TTT GTG GTT CTT OCT {SEQ ID NO:54) 
Phe L\s Gly Gin Yal Arg Phe Yal Val Leu Ala. (SEQ ID NO:55 > 
and two contain the sequence: 

CTT ATG TCT CCG TCT TTT TTT TTT TTG CCT GCG (SEQ ID NO:56) 
Leu Met Ser Pro Ser Phe Phe Phe Leu Pro Ala (SEQ ID NO:57) 

Clearly, these sequences encode novel peptides, as the native a-factor sequence differs 
considerably: 

Tyr He He Lys Gly Val Phe Trp Asp Pro Ala. 

The a-factor variants identified from random peptide libraries have utility as 
"improved" substrates of ABC transporters expressed in yeast. For example, identification of 
a preferred substrate of human MDR. one that retains agonist activity on the pheromone 
receptor, would permit the establishment of robust yeast screens to be used in the discovery of 
compounds that affect transporter function. 



Example 6 

Drug screens designed to permit discovery of molecules which modulate the 
function of ATP-dependent transmembrane transporters: 

The availability of cloned DNA encoding the related proteins human Mdrl. human 
CFTR and human MRP, will allow the construction of yeast strains expressing these 
molecules. The resultant strains will be essential to the design of microbiological assays 
which can be used to probe the function of these proteins and to discover molecules capable 
of inhibiting or enhancing their function in cellular resistance to chemotherapeutics or in ion 
transport. The present assay makes use of the transport of the yeast mating pheromone a- 
factor from autocrine y east expressing a human protein capable of substituting for yeast Ste6. 
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A. MFal- and Mdrl-containing plasmids obtained from Karl Kuehler ( University of Vienna) 
for use in these experiments: 

( 1 } pYMA 1 77 ( denoted Cadus 1067), see Figure 9. 

i2) pKK\ includes sequence encoding MFal in YFp351: a-factor is overexpressed from this 
plasmid due to increased plasmid copy number 

(3) pHaMDRl(wt) provides wild type Mdrl cDNA in a retroviral vector (initially obtained 
from Michael Gottesman. NIH). 

B. Plasmids constructed at Cadus for use in these experiments: 

A 1.5 kb BamHI-Bgll! fragment which includes MFal sequence was derived from 
pYMA177 and ligated to BamH I-digested pYMA177 to yield Cadus 1079. Cadus 1079 was 
digested with Bglll and recircularized to delete a 950 bp fragment containing sequence 
encoding the G185V mutant of human Mdrl; the resultant plasmid is Cadus 1093. 
pHaMDRl was digested with Bglll to allow isolation of a 965 bp fragment containing wild 
type human Mdrl (G185) sequence. The 965 bp Bglll fragment was inserted into Ball I- 
digested Cadus 1093 to yield Cadus 1097. The Cadus 1097 construct was verified by 
sequencing using dideoxy nucleotides. 

To yield Cadus 1 164, pYMA177 was digested with BamHI and recircularized to eliminate the 
sequence encoding MFal. Cadus 1 165 was constructed by ligating a 700 bp Bglll-BamHI 
fragment from pYMA 1 77 to the large BamHI to Bglll fragment of pYMA 1 77. This results in 
the removal of both the 1.6 kb MFa BamHI fragment and the 965 bp Bglll fragment encoding 
human Mdrl (G 1 85 V ); the resulting plasmid is Cadus 1 1 65. 

Cadus 1176 resulted from the ligation of a 965 bp Bglll fragment from pHaMDRl. 
containing sequence encoding wild type human Mdrl, to Bglli-digested Cadus 1 165. 
pRS426 (Cadus 1019) served as a URA3 control plasmid. 

The final plasmid array used in these experiments is as follows: 
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no Mdrl 



mutant Mdrl (G 185V » 



\vt Mdrl (G185) 



a-factor overexpression 



1065 



1079 



1097 



no a-factor overexpression 1019 



1 164 



1 176 



Evidence for the expression of human Mdrl constructs in yeast. 

CADLS plasmids 1065, 1079 and 1097 as well as a L'RA3 control plasmid (pRS426 
= 1019) were transformed into a ura3\ ste6' strain of yeast (WK.K.6 = CY20 = MATa ura3-l 
Ieu2-3,I12 his3-IL15 trpl-l ade2-l canl-100 ste6::IfIS3. obtained from Karl kuchler). 
Individual transformants were grown overnight in SD-LRA media and lawns containing 
approximately 5x1 0 h cells were poured onto YPD plates in 3 ml of YPD top agar. Sterile 
filter disks were placed on the plates after the top agar had solidified, and 5 ml of DMSO or 
5m\1 valinomycin in DMSO were spotted onto the filter disks. By this assay, expression of 
mutant Mdrl from plasmid 1079 in WKK6 cells conferred weak resistance to valinomycin 
while expression of wild type Mdrl from plasmid 1097 conferred complete resistance. 

Attempts to assay Mdrl activity by a-factor transport in a two cell system. 

Several attempts to demonstrate mating of WKK.6 transformed with the various 
human Mdrl constructs failed. This is in contrast to published experiments utilizing the 
mouse mdr3 gene, in which a partial complementation of mating deficiency was seen 
(Raymond et al.. 1992). 

A "halo" assay was performed by patching WK.K5 cells containing the various Mdrl 
constructs (1019, 1065, 1079 & 1097) onto a lawn of Matot cells which are supersensitive to 
a-factor (CY32 - MATa ieu2-3Jl2 irpl-289 ura3-52 his3*I sst2*2 GAL*). Although all 
halos were much smaller than those produced by STE6' cells, a small difference in halo size 
was observed with the relative order 1097 > 1079 - 1065 > 1019. This indicates that a-factor 
can be transported by the wild-type human Mdrl protein 
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expressed in yeast deleted for SI E6, however, the halo assa\ does not appear to be amenable 
to rapid drug screening. 

The halo assay detects a-factor secreted from sie6' Mdrl strains by growth arrest of 
cells present in the indicator lawn. An alternative, and potentially more sensitive, method 
makes use of the transcriptional response to pheromone signaling. A strain w ith an a-factor 
responsive tUS3 gene (CY104 = MATct uraS leu2 trpl his 3 f its I -//IS 3 can/ stel-4 TRPI) 
was cross-streaked with STE6' (CY19 = \V303-la - MATa ura3-l Iett2-3.I12 his3-IL15 
trpl-1 ade2-i ami -100) or ste6' (WKK6 = CY20 - MATa ura3-l leu2-3,112 His3-Il.l5 
trpl-l ade2-l can 1 - 1 00 ste6 111S3) strains containing various plasmids. The cross-streaking 
was performed on a plate lacking histidine and tryptophan so that only the CY104 indicator 
strain would grow if stimulated by a-factor. The order of a-factor secretion seen by this 
method was CY58>CY6I*CY62* 

CY63>CY60 where: CY58 = CY\9(STE6\ 1019). CY60 = CY20{ste6\ 1019). CY61 = 
CY20(ste6\ 1065). CY62 = CY20(ste6\ 1079) and CY63 - CY20{ste6. 1097). Thus, the 
difference in a-factor production between STE6' and ste6 and between a-factor 
overproduction or not is detectable in this system, whereas the activity of Mdrl in secreting a- 
factor is not. It was not clear from these experiments whether the signal generated from the 
ste6\ a-factor overproducing strains was due to an alternate pathway for a-factor secretion or 
the release of a-factor from U sed cells. 

Assay of Mdrl activity by a-factor transport in an autocrine system. 

A single strain system for the detection of Mdrl -mediated a-factor transport was 
constructed in order to improve sensitivity and reproducibility and to circumvent the potential 
false signal of a-factor release from lysed cells (a cell concentration-dependent phenomenon). 
A strain was constructed (CY293 = MATa ura3 leu 2 trpl his 3 fits I -HIS 3 can! ade2-l ste2- 
STE3 ste6: ERF I ) which could respond to a-factor by growing on media lacking histidine and 
in which the only impediment to the secretion of a-factor is the lack of a functional STE6 
gene. Indeed, the immediate precursor to this strain, which did contain a functional STE6 
gene, was able to grow vigorously on -HIS media containing 3 m\l aminotriazole. whereas 
CY293 did not grow at all. The addition of aminotriazole. a competitive inhibitor of the 
HIS3 enzyme, is necessary to reduce the background growth of these strains. 



The Mdrl -containing plasmids were introduced into CY293. and the transformants 
were streaked onto -HIS plates containing either 0.1 or 0.33 m\l aminotriazole. The growth 
panern thus generated was 1097 > 1067 > 1 065* 1 1 76> 1 1 64^ 10 1 9. The STF6* parent of 
CY293 exhibited a much more v igorous growth than any of these transformants. However, 
human Mdrl -mediated a-factor secretion is clearly detectable in this autocrine system. 
CY293 transformed with 1097 can be used to screen for drugs which enhance the activity of 
the Mdrl protein (increased growth on 

-HIS ^aminotriazole) as well as drugs which inhibit Mdrl activity (decreased growth on -HIS 
^aminotriazole). In the latter case, controls must be designed to identify compounds which 
inhibit yeast growth in an Mdrl -independent fashion. 

Additional strains, with improvements over CY293 (see example 6 [76-35]) for the 
expression of mammalian ABC transporters were constructed. These strains contain the tbtl- 
I allele, conferring high-efficiency transformation by electroporation, and a lesion in the 
h'ARI gene. In addition, they are auxotrophic for tryptophan, and hence can serve as hosts for 
TRP I -based plasmids. 

Two starting strains were selected, CY1555 (A/. I /a tbtl-I fusl-H!S3 trpl ura3 leu2 
his3 iys2-H0I S( and CY1557 (AM7a tbtl-I fusl-IIIS3 trpl ura3 leu2 his3 suc2). A 

plasmid containing an internal deletion of h'ARI was constructed by amplifying genomic 
sequences corresponding to the 5" end and the 3" end of FAR I, and ligating them together in 
pRS406 (an integrative vector containing the f RA3 gene), thereby creating a deletion from 
the 50th to the 696th predicted codon of h'ARI. The oligonucleotides used for amplification 
were: 

for the 5' segment of FAR I: 

5'-CGGGATCCGATGCAATTTTCAACATGC-3* (23FAR1 ) (SEQ ID NO:58) 

and 5 , -GCTCTAGATGCTACTGATCCCGC-3" ( 1RAF616) (SEQ ID NO:59) 

and for the 3' segment of FAR I: 

5"-CGCCGCATGACTCCATTG-3" (2552FAR1 ) (SEQ ID \O:60) 

and S'-GGGGTArCA ATAGGTTCTTTCTTAGG^V ( 1 RAF2979). ( SEQ ID NO:61 ) 

The resultant amplification products were restricted with BamHI and Xbal (5' 
segment: 0.6 kb) or Nhel and Kpnl (3' segment; 0.4 kb). and ligated into pRS406 (Cadus 
101 1) that had been restricted with BamHI and KpnI. The resultant plasmid (Cadus 1442) 



was restricted with EcoRl to direct integration at the FAR1 locus. Lra* transformants were 
purified and subjected to selection on 5-fluoro-orotic acid, and L ra- clones were screened for 
the impaired mating ability conferred by the furl deletion. The SEE6 gene was subsequently 
deleted using the ste6 hisG-l RA3 plasmid (Cadus 1 170; constructed by Karl kuchler). and 
the STE2 coding sequences were replaced with SEE3 coding sequences as described in 
Example 1. The resultant strains were named CY1880 (AM 7a ste2-SEE3 ste6 hisG far I A 
1442 ihil-i fusl-HlS3 trpl ura3 Ieu2 his3 sitc2 ) and CY1882 (A/,17a .sfe2-SEE3 stc6 hisG 
far I Al 442 tbt 1 - 1 fus 1 -t I IS3 trpl ura3 Ieu2 his3 I\s2-$0! SI '('*). 

Like strain CY293 these strains show a dramatic enhancement of growth (via the fusl- 
HIS3 reporter gene) under conditions of histidine starvation when transformed with the 
MDRl-encoding plasmids Cadus 1097 and Cadus 1176 (as compared to control plasmids 
lacking MDR1. Cadus 1065 and 1019). In addition, these strains display more robust general 
growth, transform by electroporation at high efficiency (the tbt I- 1 effect), lack susceptibility 
to pheromone-induced growth arrest (due to inactivation of FAR/), and act as hosts for TRPI- 
based plasmids {ste6::hisG instead of ste6 : : I RP 7 ). 

These strains also act as suitable hosts for the STE6-CF I R chimeras constructed by 
Teem et al. (1993) (see Example 7). When compared with CY293 in their ability to 
distinguish between wild type and AF508 STE6-CFTR in their ability to transport a-factor, a 
similar enhancement is seen: 



Host strain wild type AF508 growth ratio 

CY293 18 
CY1880 7 
CY1882 8 



The strains were transformed with Cadus 1515 {STE6-CFTR(II5) I RA 3 CEX) or 1516 
{STE6-CFTR(/i5)S¥50S ( RA 3 CEX) and inoculated at various cell densities (OD wl(1 =0.003 to 
0.048) into histidine-free media containing various concentrations of 3-aminotriazole (0 to 
1.2 m\l). After overnight growth in microtiter wells the optical densities at 600 nm of the 
wells were measured, and ratios for wild type vs. AF508 calculated. The ratios reflect the 



highest ratios obtained in this experiment but not necessarik the highest ratio it would be 
possible to obtain. 

Improvements on the autocrine yeast expressing Mdrl. 

The results described above indicate that the human Mdrl protein transports a-factor 
less efficiently than does the yeast STE6 protein. Attempts will be made to isolate mutant a- 
factor molecules which are transported more efficiently by human Mdrl and yet which retain 
agonist activity on the a-factor receptor (STE3 protein). To do this, a-factor coding 
sequences will be chemically synthesized using "dirty" nucleotides and inserted into an a- 
factor expression cassette. An example of "dirty" synthesis would be to incorporate 
nucleotide from a mixture of 70% G and 10% each of A. T and C at a position in the a-factor 
sequence where G would normally appear. Using oligonucleotides generated in this manner, 
a diverse library of peptides can be expressed in yeast and screened to identify those peptides 
which retain the ability to signal to the STE3 protein but which are also a favorable substrate 
for transport by human Mdr 1 . 

A second improvement to the system will be the addition of a pheromone-inducible 
"negative selection" marker. For instance, the FUS1 promoter can be connected to GAL1 
coding sequences. Expression of GAL1 is toxic in the presence of galactose in strains which 
contain mutations in either the GAL7 or GAL 10 genes. In the context of an autocrine Mdrl 
strain, this selection system should render cells galactose-sensitive. Addition of a compound 
which inhibits the ability of the Mdrl protein to secrete a-factor would allow this strain to 
grow on galactose-containing media. This selection system would also eliminate false 
positives due to lethality. Controls must still be designed to identify compounds which 
interfere at other points in the pheromone response pathway. 



The third improvement in the system involves inactivation of yeast genes which 
function equivalently to mammalian MDR genes. A network of genes involved in pleiotropic 
drug resistance (PDR) have been identified in veast. This modification will be useful in anv 
yeast screen designed to assay the interaction of compounds w ith intracellular targets. 
The improved autocrine veast strain, expressing human Mdrl. will be used to screen 
compound libraries for molecules which inhibit the transport function of this protein. In 
addition. Mfa expression cassettes containing oligonucleotides encoding random peptides 
will be expressed in the autocrine Mdrl strain to identify peptides capable of inhibiting the 
transport of a-factor. or an a-factor analogue by Mdrl . 

Example 7. 

Identification of an analogue of yeast a-factor that is transported by wild type human 
CFTR. 

This example describes the use of autocrine yeast strains to identify molecules capable 
of enhancing transport by dysfunctional ATP-dependent transmembrane transporters, e.g. 
mutant human CFTR proteins. The wild type human CFTR protein will not substitute for 
Ste6 function in yeast by transporting native a-factor pheromone (John Teem, unpublished 
observations). In order to maximally exploit autocrine yeast strains for the discovery of 
molecules which enhance mutant CFTR function, an a-factor-like peptide which can serve as 
a substrate for transport by CFTR will be identified. An a-factor analogue must also bind 
functionally to the pheromone receptor, Ste3. in order to initiate pheromone signalling. The 
CFTR transport substrate will be identified by expressing a randomly mutated MFa 
expression cassette in autocrine yeast deleted for Ste6 expression, but expressing the wild 
type human CFTR protein. 

Expression of mutant human CFTR proteins in autocrine yeast. 

Transport of an a-factor-like peptide pheromone by CFTR will serve as the basis for 
the design of screens to be used in the identification of compounds which augment the 
transport function of CFTR proteins containing cystic fibrosis (CF) mutations. Based on 
studies done by Teem et al. ( 1993) using Ste6 CFTR chimeras, mutant CFTR is not expected 
to efficiently transport an a-factor analogue. Teem et al. ( 1993) have made chimeric proteins 



by substituting varving portions of the sequences encoding the first nucleotide binding 
domain of human CFTR for analogous sequence in yeast STK6. The chimeric proteins will 
transport native \east a-factor when expressed in yeast. However, introduction of a CF 
mutation (AF508) reduces the ability of these proteins to transport the yeast pheromone. 
Teem et al. have also identified revertants in yeast, i.e., proteins containing second site 
mutations which restore the ability of chimeras bearing the CF mutation to transport a-factor. 
Introduction of the revertant mutations into defective CFTR protein expressed in mammalian 
cells decreased in part the processing and channel defects of the AF508 protein. 
With the identification of an analogue of yeast a-factor that will serve as a substrate for 
transport by CFTR, mutations can be introduced directly into the human CFTR protein 
expressed in yeast, eliminating the necessity of creating chimeric proteins containing Ste6 
sequence. This will permit the targeting of potential CF therapeutics to the entire human 
CFTR molecule. Mutations of interest include G55 1 D, N 1 303K and 

AI507 in addition to AF508: these mutations are naturally occurring, give rise to cystic 
fibrosis in affected individuals, and appear to affect either the transport and processing of 
CFTR or the regulation of function of that protein (Welsh and Smith 1993). These mutations 
also rank among the most common alterations of CFTR thus far identified in CF patients. 
If a peptide which will serve as a substrate for transport by CFTR cannot be identified, 
chimeric Ste6 CFTR proteins and unaltered a-factor can be utilized in screens based on 
autocrine strains of yeast. These strains offer distinct advantages in screening applications: 
easy adaptability to large-scale automation, simplicity and increased sensitivity when 
compared to traditional yeast mating cell assays of pheromone signaling, and the potential to 
employ the instant technology to identify active peptide structures. 



Identification of compounds that enhance transport of a-factor-like peptide by mutant 
CFTR. 

Autocrine strains expressing mutant human CFTR protein will be capable of growth 
on rich media but, due to inadequate transport of and signaling by an a-factor analogue, w ill 
not grow efficiently on selective ( histidine-deficient ) media. Pheromone signaling in these 
strains initiates expression from a Ft SI promoter sequence controlling transcription of the 
His3 enzyme; expression of His3 is required for growth on media lacking histidine. These 
strains will be used to screen compound libraries to identify molecules which reverse the 
inability of the mutant CFTR to transport a-factor analogue and permit grow th of the yeast on 
histidine-deficient medium. Alternatively, active compounds would be capable of signaling 
to the a-factor receptor. Ste3. directly, or may interact with the pheromone response pathway 
elsewhere. Suitable controls would differentiate among these possibilities. 

Identification of random peptides that enhance transport by mutant CFTR. 

Plasmids containing oligonucleotides which encode random peptides will be 
expressed in autocrine yeast bearing a mutant human CFTR. These peptides, expressed using 
a-factor-based expression cassettes, will be transported to the extracellular environment via 
the yeast secretory pathway. Peptides of interest w ill be identified by virtue of their ability to 
permit the growth of a mutant CFTR-containing strain on histidine-deficient medium. Active 
peptides would permit transport of a-factor analogue by the mutant human CFTR. 
Alternatively, a peptide may interact at other points along the pheromone response pathway. 
Suitable controls would differentiate between these possible outcomes. 

Example 8. 

Prophetic Example of Substitution of Prohormone Convertase PCI for V east KEX2. 



The mammalian prohormone convertases PCI PC3 and PC2 are involved in the 
proteolytic processing of proopiomelanocortin (POMC). with PCI preferentially releasing 
adrenocorticotropic hormone (ACTH) and p-lipotropin and PC2 preferentially releasing (3- 
endorphin. N-terminally extended ACTH containing the joining peptide (JP). and either a- 
melanocyte-stimulating hormone (a-MSH) or des-acetyl a-MSH (Benjannet. S.. et al. ( 1991 ) 
Proc. Natl. Acad. Sci. USA 88:3564-3568; Seideh N.G. et al. (1992) FEBS Lett. 310. 235- 
239). By way of example, a yeast strain is described in which mammalian PCI processes a 
chimeric pre-pro-POMC a-factor peptide, permitting the secretion of mature a-factor and 
stimulation of the screening strain in an autocrine fashion to histidine protrotrophy. 
Autocrine strains will be constructed in which: 1. yeast K.EX2 is disrupted; 2. yeast KEX2 
activity is substituted with that of mammalian PCI; 3. a novel MFa construct containing the 
dibasic cleavage site recognized by PCI (in place of the K.EX2 recognition sequence) will be 
expressed; 4. production of mature a-factor will require PCI activity; and 5. growth of the 
strain will be stimulated by the production of mature a-factor. 

The genotype of the parental strain for this example is MATa barl::hisG far 1-1 fusl- 
HIS3 stel4::TRPl ura3 trpl leu2 his3. Initially, the KEX2 allele of this strain will be 
disrupted using an integrating plasmid (pAv.t2._l) encoding the flanking regions of the K.EX2 
locus with the entire coding regions from the initiator codon to the terminator codon deleted. 
Cleavage of pke.\2A with Bsu36I followed by transfection into the parental strain will result 
in integration of this plasmid into the K.EX2 locus. Transformation can be scored as 
conversion to uracil prototrophy. Subsequent transfer of URA- transformants to plates 
containing 5-fluoroorotic acid results in the growth of colonies with the kexlA allele. 
Integration can be confirmed by Southern blot analysis and by colony PCR using 
oligonucleotide primers flanking the K.EX2 locus. This Aev/J strain will be able to grow on 
histidine-deficient media in the presence of exogenenously added a-factor. but will not be 
able to grow in an autocrine fashion on histidine-deficient media once transfected with Cadus 
1219 (URA3 2mu-ori REP3 AmpR fl-ori a-factor) since processing of the pre-pro-POMC a- 
f actor chimera expressed from Cadus 1219 requires KEX2 activity. 

In the screening strain. PCI activity will substitute for the deleted KEX2 activity in 
the maturation of a-factor peptide. PCI cDNA (accession » M69196) obtained from mouse 
{Korner et al ( 1991 ) Proc. Natl. Acad. Sci L'SA 88:6834-6838) was found to encode a protein 



of 753 amino acids. Sequences encoding PCI will be cloned into a high copy replicating 
plasmid (Cadus 1289). Yeast cells transformed with this plasmid will acquire the ability to 
grow on leucine-deficient media and will express high levels of PCI protein due to the 
presence of the PGK promotor. 

A hybrid gene encoding the prepro-region of human POMC (accession ~ K.02406: 
TakahashL H.. et al ( 1 983 ) Nucleic Acids Research 1 1 :6847-6858) and the coding region of a 
single repeat of mature a-factor will be constructed in the following fashion. The prepro- 
region of human POMC w ill be amplified w ith an Hindlll site at the 5' end and a Bbsl site at 
the 3' end using VENT polymerase and the following primers: 5' GGG AAGCTT 
A TGCCGAGATCGTGCTGCCAGCCGC 3' (SEQ ID NO:30) (Hindlll site is underlined and 
initiation codon is italic bold) and antisense 5' 

GGG GAAGAC TTCTGCCCTGCGCCGC TGCTGCC 3' (SEQ ID NO:31) (Bbsl recognition 
is underlined), leaving the amino acid sequence -SSGAGQKR- (SEQ ID NO:125) at the 3' 
end with a Bbsl site leaving an overhang at the -KR- dibasic cleavage sequence. The coding 
region of a-factor will be amplified from Cadus 1219 with a Bbsl site at the 5' end and a 
Bglll site at the 3' end using the primers 5' GGGGAAGACCCGCAGGAGGCAG AAGCTT 
GGTTGCAG 3' (SEQ ID NO:32) (Bbsl site is underlined) and 5 1 
GGGAGATCTTCAGTACATTGGTTGGCC 3* (SEQ ID NO:33) ( Belli site is underlined, 
termination codon is bold). The PCR fragment encoding the pre-pro segment of POMC is 
restricted with Hindlll and Bbsl and gel purifed, the PCR fragment encodinga-factor is cut 
with Bbsl and Bglll and gel purified, and Cadus 1215 is cut with Bglll and partially with 
Hindlll and the Hindlll-Bglll restricted vector containing the pAlter polylinker sequences is 
gel purified. Three-part ligation of the two PCR products with Hindlll and Bglll digested 
Cadus 1215 will y ield a hybrid POMC a-factor gene in which the first 104 amino acids 
residues are from POMC and the remaining 17 are from a-factor. The structure of this hy brid 
gene around the PCI cleavage site is: 

RNSSSSGSSGAGOKR EAEA Wf iW'LOLKPGOPMY* (SEQ ID NO:34) where residues 
donated by POMC are underlined, the dibasic cleavage site is underlined bold, and the 
sequence of mature a-factor is in italics. The tetrapeptide -EAEA- juxtaposed between the 
dibasic cleavage site and the amino-terminal tryptophan of mature a-factor should be 
removed by the dipeptidyl aminopeptidase activity of stel3p. 



Introduction of the PCl-encoding plasmid and the POMC a-factor plasmid into a 
kex2A strain with the genotype MATa barl::hisG far 1 - 1 fusl-HIS3 stel4::TRPl ura3 trpl 
leu2 his3 should result in autocrine growth that is dependent on PCI -mediated cleavage of the 
dibasic motif at the amino-terminal side of the single copy of a-factor encoded by the POMC 
a-factor plasmid. That is. the autocrine behaviour of this strain should be due to expression 
of PC 1 . A suitable negative control is provided by expression of mammalian PC2 in place of 
PCI; the cleavage site of POMC that is inserted upstream of thea-factor gene is not 
recognized by PC2. Therefore, it should be possible to construct strains specific for PC1- 
versus PC2-dependent processing by judicious choice of cleavage sites from POMC to 
append to the 5' end of the a-factor gene. Compounds or random peptides that disrupt the 
autocrine growth of this strain but which do not have growth inhibitor) effects when this 
strain is grown in histidine-containing media are potential inhibitors of PCI activity. 

Example 9. 

Prophetic Example of Substitution of Human MEK (MAP Kinase Kinase) for Yeast 
STE7. 

In order to develop a screen for compounds which act as inhibitors of a mammalian 
kinase, one could construct a yeast strain that is deficient in STE7 activity, and which 
contains human MEK in a yeast expression vector. In addition, the strain would be equipped 
with reporter capacities, as described below. 

To disrupt the yeast S7'E~ gene, the following approach could be taken: 
pBluescriptKS^. a multicopy E. coli vector, is engineered to contain STE~ sequences deleted 
for 5' noncoding and promoter sequence and for a sizeable portion of the coding region. The 
|ste7 sequence is then subcloned into pRS406|ClaI, a Bluescript-based plasmid containing the 
yeast URA3 gene and the resulting plasmid, pRS406;ClaI!ste7, is used to disrupt the wild type 
STE~ gene as follows. pRS406|ste7 is digested with Clal. and used to transform yeast strain 
CY252 (genotype MAT* ste!4::TRPl fits I-HIS3 far 1-1 ura3 leu2 trpl his3 ade2-l met I) to 
uracil prototrophy. Subsequent transfer of Ura* transformants to media containing 5- 
fluoroorotic acid results in colonies containing the ste~ allele, which is confirmed by 
Southern analysis and by the inability of the strain to grow in the absence of histidine when 
stimulated with a pheromone. 



To construct a plasmid capable of expressing human MEK in >east cells, the 
following oligonucleotides are constructed: 5'- 

CCGCGTCTCAC ATGCCCAAG A AG A AGCCG-3' (SEQ ID NO:35) (forward) and 5- 
CCGTCTAGATGCTGGCAGCGTGGG-3* (SEQ ID NO:36) (reverse). When used in a 
polymerase chain reaction (PGR) with human cDNA as template, these primers will direct the 
amplification of DNA encoding human MEK. To insert the human MEK gene into a yeast 
expression vector, the PGR product is digested with Esp3I and Xbal (bold sequences above) 
and ligated to the yeast-/: co/i shuttle plasmid Cadusl289. previously digested with Ncol and 
Xbal. The resulting plasmid will replicate autonomously in yeast cells, confer leucine 
prototrophy to a yeast Ieu2 mutant, and direct the expression of MEK from the constitutive 
PGK1 promoter. 

When the above-described PGK1-MEK plasmid is introduced into the CY252-ste~ 
cells, it should restore their ability to grow in the absence of histidine when incubated with a 
pheromone, due to the ability of MEK to functionally replace STE7 in the pheromone 
response pathway and thereby effect the stimulation of the fus I-/IIS3 fusion situated in the 
chromosome. One could then screen for compounds which are able to reverse a pheromone- 
dependent growth in the absence of histidine, but which have no nonspecific toxic effect in 
the presence of histidine. 

In an autocrine embodiment, the yeast cells made ste~ are of the genotype l/.f/a 
hurl hisd farl-l fus/-ff/S3 ste 1 4 trpl A KS'2 ura3 trpl leu2 hisS iys2 <CY588trp). The 
procedure followed is exactly as above for CY252. After construction of CY588trpste ~ . the 
cells created are transformed with the plasmid expressing MEK, as well as with a plasmid 
capable of expressing secreted a pheromone. This transformed strain 

(CY588trp.vte ~ [MEK MFa] ) should be able to grow on media lacking histidine and 
containing high (20 22222222222222m M) amounts of 3-aminotriazole. The growth of this 
strain on this media should be strictly dependent on the presence of both plasmids (each 
necessary, neither sufficient). Compounds that interfere with this growth could be tested as 
above, with the exception that exogenously added a pheromone is unnecessary. 

In addition. CY588trp.?f<?" [MEK MFa] could be transformed with a plasmid library 
expressing cytoplasmically targeted random peptides. Those that interfere with the function 
of MEK would be identified by replica plating to media deficient in histidine (and potentially 



containing 3-aminotriazole ); cells expressing such inhibitor) peptides would be His". Such a 
screen could be streamlined with the addition of reporter constructs with the potential of 
negativ e selection, such as fus I -( R.13 or fus I '-HAL I . In this case inhibitor) peptides would 
confer on the target strain the ability to grow on 5-FOA- (for cells with diminished fusl- 
l RA3) or galactose- (for cells with diminished fus /-(/'. 1 L I in aga/IO background) containing 
media. 

Confirmatory tests would include biochemical assay of the activity of MEK (either in 
vitro or in vivo) in the presence of random peptides or other molecules identified as potential 
MEK inhibitors. 

Example 10. 

Functional Expression of a Mammalian G Protein-Coupled Receptor and Ligand in an 
Autocrine Yeast Strain. 

In this example we describe a set of experiments that detail the accomplishment of the 
follow ing: (1 ) expression of human C5a receptor in yeast; (2) expression of the native ligand 
of this receptor, human C5a. in yeast: and (3) activation of the endogenous yeast pheromone 
pathway upon stimulation of the C5a receptor by C5a when both of these molecules are 
expressed w ithin the same strain of autocrine yeast. Follow ing the experimental 



data we outline the utility of autocrine strains of yeast that functionally express the human 
C5a receptor. 

Human C5a is a 74 amino acid polypeptide that derives from the fifth component of 
complement during activation of the complement cascade: it is the most potent of the 
complement-derived anaphv latoxins. C5a is a powerful activator of neutrophil and 
macrophage functions including production of cvlotoxic superoxide radicals and induction of 
chemotaxis and adhesiveness. In addition C5a stimulates smooth muscle contraction, induces 
degranulation of mast cells, induces serotonin release from platelets and increases vascular 
permeability. The C5a anaphviatoxin can also amplify the inflammatory response by 
stimulating the production of cytokines. As C5a is a highly potent inflammatory agent, it is a 
primary target for the development of antagonists to be used for intervention in a variety of 
inflammatory processes. 

The C5a receptor is present on neutrophils, mast cells, macrophages and smooth 
muscle cells and couples through G proteins to transmit signals initiated through the binding 
of C5a. 

Expression of the C5a Receptor 

The plasmid pCDM8-C5aRc. bearing cDNA sequence encoding the human C5a 
receptor, was obtained from N. Gerard and C. Gerard (Harvard Medical School, Boston, MA) 
(Gerard and Gerard 1991). Sequence encoding C5a was derived from this plasmid by PCR 
using VENT polymerase (New England Biolabs Inc.. Beverly MA), and the following 
primers: 

*\ - GGTGGGAGGGTGCTCTCTAGAAGGA AGTGTTCACC (SEQ ID NO:62) 
*2 - GCC CAGGAGACCAGACC ATGG ACTCCTTCAATTATACCACC. (SCO IDNO:63) 
Primer #1 contains a single base-pair mismatch (underlined) to C5a receptor cDNA. It 
introduces an Xbal site (in bold) 201 bp downstream from the TAG termination codon of the 
C5a receptor coding sequence. Primer p-2 contains two mismatched bases and serves to create 
an Ncol site (in bold) surrounding the ATG initiator codon (double underlined). The second 
amino acid is changed from an aspartic acid to an asparagine residue. This is 



the only change in primary amino acid sequence from the wild type human C5a receptor. 

The PCR product was restricted with Ncol and Xbal (sites in bold) and cloned into 
CADIS 1002 (YEp51Nco). a Gal 10 promoter expression vector. The sequence of the entire 
insert was determined by dideoxy sequencing using multiple primers. The sequence between 
the Ncol and Xbal sites was found to be identical to the human C5a receptor sequence that 
w as deposited in GenBank (accession 3 J05327) w ith the exception of those changes encoded 
by the PCR primers. The C5a receptor-encoding insert was transferred to CADLS 1289 
(pLPXt). a PGK promoter expression vector, using the Ncol and Xbal sites, to generate the 
C5a receptor yeast expression clone. CADI'S 1303. 

A version of the C5a receptor which contains a yeast invertase signal sequence and a 
myc epitope tag at its amino terminus was expressed in Cadus 1 270-transferred yeast under 
control of a GAL10 promoter. Plasmids encoding an untagged version of the C5a receptor 
and a myc-tagged derivative of FL'Sl served as controls. The expression of the tagged 
receptor in yeast was confirmed by Western blot using the anti-myc monoclonal antibody 
9E10. In the lane containing the extract from the Cadus 1 270-transformant. the protein that is 
reactive with the anti-myc monoclonal antibody 9EI0 was approximately 40 kD in size, as 
expected. Note that this receptor construct is not identical to the one used in the autocrine 
activation experiments. That receptor is not tagged, does not contain a signal sequence and is 
driven by the PGR promoter. 

Expression of the Ligand, C5a 

A synthetic construct of the sequence encoding C5a was obtained from C. Gerard 
(Harvard Medical School. Boston. MA). This synthetic gene had been designed as a FLAG- 
tagged molecule for secretion from E. colt (Gerard and Gerard 1990). 

The C5a coding region, still containing E coli codon bias, was amplified using VENT 
polymerase (New England Biolabs Inc., Beverly MA) through 30 cycles using the following 
primers: 



C5a5* = CCCCTTA AGCGTGAGGCAG A AGCT ACTCTGCA AA AGAAGATC 
NO:64) and 

C5a3* - GAAGATCTTCA GCGGCCGAGTTGCATGTC (SEP ID NO:65) 



(SEQ ID 



A PCR product of 257 bp was gel isolated, restricted with Aflll and BgllK and cloned 
into CADUS 1215 (an expression vector designed to express peptide sequences in the context 
of Mfa) to yield CADUS 1297. The regions of homology to the synthetic C5a gene are 
underlined. The 5' primer also contains pre-pro a-factor sequence. Upon translation and 
processing of the pre-pro a-factor sequence, authentic human C5a should be secreted by yeast 
containing CADUS 1297. The insert sequence in CADUS 1297 was sequenced in both 
orientations by the dideoxy method and found to be identical to that predicted by the PCR 
primers and the published sequence of the synthetic C5a gene (Franke et al. 1988 ). 

Two sets of experiments, aside from the autocrine activation of yeast detailed below, 
demonstrated that CADUS 1297 can be used to express C5a in yeast. 

1. ) C5a was immunologically detected in both culture supernatant and lysed cells 
using a commercially available enzyme-linked immunosorbent assay (ELISA) (Table 3). 
This assay indicated the concentration of C5a in the culture supernate to be approximately 50 
to 100 n\l. In comparison, in data derived from mammalian cells, the binding constant of 
C5a to its receptor is 1 n\l (Boulay et al. 1991 ). 

2. ) C5a expressed in yeast was shown to compete for binding with commercially 
obtained (Amersham Corporation. Arlington Heights. IL). radiolabeled C5a on induced HL60 
cells. 

Activation Of the Pheromone Response Pathway In Autocrine Yeast Expressing the 
Human C5a Receptor and Human C5a 

Activation of the yeast pheromone response pathway through the interaction of C5a 
with the C5a receptor was demonstrated using a growth read-out. The strain used for this 
analysis, CY455 ( MA Ta tbtl - 1 ura3 Ieu2 trpl his3 fus1-HIS3 canl stel4::TRPl ste3*1156). 
contains the following significant modifications. A pheromone inducible HIS3 gene, fusl- 
HIS3. is integrated at the FUS1 locus. A hybrid gene containing sequence encoding the first 
41 amino acids of GPA1 (the yeast Ga subunit) fused to sequence encoding human Gai2a 



(minus codons for the N-terminal 33 amino acids) replaces GPA1 at its normal chromosomal 
location. The \east STEM gene is disrupted to lower the basal level of signaling through the 
pheronione response pathway. The \east a-factor receptor gene. STE3. is deleted. The last 
two modifications are probably not essential, but appear to impro\ e the signal-to-noise ratio. 

CY455 ( AfA To. tbtl-1 ura3 leu2 trpl his3 fusl-HlS3 canl stel4::TRPl ste3*1156) 
was transformed with the following plasmids: 

Cadus 1289 - Cadus 1215 = Receptor Ligand" - (R-L-) 
Cadus 1303 - Cadus 1215 - Receptor* Ligand = (R-L-) 
Cadus 1280 - Cadus 1297 - Receptor" Ligand* - (R-L-) 
Cadus 1303 - Cadus 1297 - Receptor' Ligand* - (R-L-) 
Receptor refers to the human C5a receptor. 
Ligand refers to human C5a. 

Three colonies were picked from each transformation and grown overnight in media 
lacking leucine and uracil, at pH 6.8 with 25 m\l PIPES (LEU' URA pH6.8 with 25 mM 
PIPES). This media was made by adding 0.45 ml of sterile 1 M KOH and 2.5 ml of sterile 1 
M PIPES pH 6.8 to 100 ml of standard SD LEU URA" media. After overnight growth the pH 
of this media is usually acidified to approximately pH 5.5. Overnight cultures were washed 
once with 25 mM PIPES pH 6.8 and resuspended in an equal volume of media lacking 
leucine, uracil and histidine (LEU URA HIS pH6.8 with 25 mM PIPES). The optical 
density at 600nm of a 1 20 dilution of these cultures was determined and the cultures were 
diluted into 25 mM PIPES pH 6.8 to a final OD Nlh of 0.2. A volume (5 u.1) of this dilution 
equivalent to 10,000 cells was spotted onto selective (HIS' TRP pH6.8 ImM 
aminotriazole) or non-selective (HIS* TRP" pH6.8) plates. Only those strains expres-sing 
both C5a and its receptor (R+L+) show growth on the selective plates which lack histidine. 
All test strains are capable of growth on plates containing histidine. The R^L+ strain will 
grow on plates containing up to 5 mM aminotriazole. the highest concentration tested. 
For verification of pheromone pathway activation and quanti-fication of the stimulation, the 
activ ity of the fusl promoter was determined colorometrically using a fusl-lacZ fusion in a 
similar set of strains. CY878 ( MATcc tbtl-1 fusl-HIS3 canl stel 4::trpl ::LYS2 ste3* 1 156 



gpal(4 1 >-Gcti2 ) was used as the starting strain for these experiments. This strain is a trpl 
derivative of CY455. The transformants for this experiment contained CADLS 1584 
I pRS424-fus 1 -lacZ } in addition to the receptor and ligand plasmids. Four strains were grown 

overnight in SD LEU L'RA TRP pH6.8 with 50mM PIPES to an OD of less than 0.8. 

Assay of p-galactosidase activity (Guarente 1 983 ) in these strains \ields the data shown in 
Figure 10. 

Projected I ses of the Autocrine C5a Strains: 

A primary use of the autocrine C5a strains w ill be in the discover) of C5a antagonists. 
Inhibitors of the biological function of C5a would be expected to protect against tissue 
damage resulting from inflammation in a wide variety of inflammatory disease processes 
including but not limited to: respiratory distress syndrome (Duchateau et al. 1984: 
Hammerschmidt et al. 1980). septic lung injury (Olson et al. 1985), arthritis (Banerjee et al. 
1989), ischemic and post-ischemic myocardial injury (Weisman 1990: Crawford et al. 1988) 
and burn injury (Gelfand et al. 1982). 

The autocrine C5a system as described can be used to isolate C5a antagonists as 
follows: 

1 . High t h roughput screens to identify antagonists of C5a. 

A straightforward approach involves screening compounds to idemif> those which 
inhibit growth of the R-L^ strain described above in selective media but w hich do not inhibit 
the growth of the same strain or of a R^L- strain in non-selective media. The counterscreen 
is necessary to eliminate from consideration those compounds which are generally toxic to 
yeast. Initial experiments of this type have led to the identification of compounds with 
potential therapeutic utility. 



2. Identification of antagonists usirm negative selection. Replacement of the fusl-MIS3 
read-out with one of several negative selection schemes (fus!-L'RA3 FOA. fusl- 
GAL1 galactose or deoxygalactose. FAR1 sst2 or other mutations that render yeast 
supersensitive for growth arrest) would generate a test system in which the presence of an 
antagonist would result in the growth of the assay strain. Such an approach would be 
applicable to high-throughput screening of compounds as well as to the selection of 
antagonists from random peptide libraries expressed in autocrine yeast. Optimization of 
screens of this type would involve screening the R*L* strain at a concentration of amino- 
triazole which ablates growth of the R^L- strain (we are currently using 0.6 to 0.8 mM) and 
counterscreening the R-L- strain at a concentration of aminotriazole which gives an identical 
growth rate (we are using 0.14 mM ). In addition, the system could employ one of several 
colorometric. fluorescent or chemiluminescent readouts. Some of the genes which can be 
fused to the fusl promoter for these alternate read-outs include lacZ (colorometric and 
fluorescent substrates), glucuronidase (colorometric and fluorescent substrates), phosphatases 
(e.g. PH03. PH05, alkaline phosphatase; colorometric and chemi-uminescent substrates), 
green protein (endogenous fluorescence), horse radish peroxidase (colorometric). luciferase 
( chemiluminescence). 

The autocrine C3a strains have further utility as follows: 

3. in the identification of novel C5a agonists from random peptide libraries expressed in 
autocrine yeast. 

Novel peptide agonists would contribute to structure function analyses used to guide 
the rational design of C5a antagonists. 

4. In the identification of receptor mutants. 

Constitutively active, that is, ligand independent, receptors may be selected from 
highly mutagenized populations by growth on selective media. These constitutively active 
receptors may have utility in permitting the mapping of the sites of interaction between the 
receptor and the G-protein. Identifica-tion of those sites may be important to the rational 
design of drugs to block that interaction. In addition, receptors could be selected for an ability 
to be stimulated by some agonists but not others or to be resistant to antagonist. These 



\ariant receptors would aid in mapping sites of interaction between receptor and agonist or 
antagonist and would therefore contribute to rational drug design efforts. 

5. In the identification of molecules that interact w ith Gai2. Compounds or peptides 

which directly inhibit GDP exchange from Gai2 would have the same effect as C5a 
antagonists in these assays. Additional information would distinguish inhibitors of GDP 
exchange from C5a antagonists. This information could be obtained through assays that 
determine the following: 

1 inhibition by test compounds of Gcxi2 activation from other receptors, 

2 failure of test compounds to compete w ith radiolabeled C5a for binding to the C5a 
receptor, 

3 failure of test compounds to inhibit the activation of other Get subunits by C5a, and 

4 inhibition by test compounds of signalling from constitutively active versions of 
C5a, or other, receptors. 

Example 1 1. Construction of Hybrid Get Genes 

Construction of two sets of chimeric yeast/mammalian Got genes, GPA jr Ga and 
GPAl Bam -Gct. 

The Got subunit of heterotrimeric G proteins must interact with both the py complex and the 
receptor. Since the domains of Get required for each of these interactions have not been 
completely defined and since our final goal requires Got proteins that communicate with a 
mammalian receptor on one hand and the yeast py subunits on the other, we desired to derive 
human-yeast chimeric Get proteins with an optimized ability to perform both functions. From 
the studies reported here we determined that inclusion of only a small portion of the amino 
terminus of yeast Get is required to couple a mammalian Get protein to the yeast py subunits. 
It was anticipated that a further benefit to using these limited chimeras was the preservation 
of the entire mammalian domain of the Get protein believed to be involved in receptor contact 
and interaction. Thus the likelihood that these chimeras would retain their ability to interact 
functionally with a mammalian receptor expressed in the same yeast cell was expected to be 
quite high. 
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Plasmid constructions. 

pRS4\ 6-G PA I (Cadus 1069). An Xbal - Sad fragment encoding the entire GPA I 
promoter region, coding region and approximately 250 nucleotides of 3' untranslated region 
was excised from YCplac 1 1 1 -G7\ J / (from S. Reed. Scripps Institute) and cloned into YEp 
vector pRS416 (Sikorski and Hieter. Genetics 122: 19 ( 1989)) cut with Xbal and Sad. 

Site-directed mutagenesis of GPAl (Cadus 1075, 1121 and 1122). A 1.9 kb EcoRI 
fragment containing the entire GPAl coding region and 200 nucleotides from the 5' 
untranslated region was cloned into EcoRI cut, phosphatase-treated pALTER-1 (Promega) 
and transformed by electroporation (Biorad Gene Pulser) into DH5ctF' bacteria to yield Cadus 
1075. Recombinant phagemids were rescued with M13KG7 helper phage and single stranded 
recombinant DNA was extracted and purified according to the manufacturer's specifications. 
A new Ncol site was introduced at the initiator methionine of GPAl by oligonucleotide 
directed mutagenesis using the synthetic oligonucleotide: 

5* G A T A T A TT A A G GT A G G A A A CCATGGG GTGT A C AG TG AG 3' (SEP ID NO:66) . 
Positive clones were selected in ampicillin and several independent clones were sequenced in 
both directions across the new Ncol site at +1. Two clones containing the correct sequences 
were retained as Cadus 1121 and 1 122. 

Construction of a CPA /-based expression vector (Cadus 1127). The vector used 
for expression of full length and hybrid mammalian Ga proteins in yeast, Cadus 1 127. was 
constructed in the following manner. A 350 nucleotide fragment spanning the 3' untranslated 
region of GPAl was amplified with Taq polymerase (AmpliTaq; Perkin Elmer) using the 
oligonucleotide primers A (5' CGAGQGCTCGAGGGAACGTATAATTAA AGTAGTG 3' 
(SEP ID NO:67> ) and B (5' GCGCGGTACCA AGCTTQA ATTCG AG ATA ATACCC 3' 
(SEP ID N'G:68) ). The 350 nucleotide product was purified by gel electrophoresis using 
GeneClean II (BiolOl) and was cloned directly into the pCRII vector by single nucleotide 
overlap TA cloning ( InVitrogen). Recombinant clones were characterized by restriction 
enzyme mapping and by dideoxynucleotide sequencing. Recombinant clones contained a 
novel Xhol site 5' to the authentic GPAl sequence and a novel Kpnl site 3' to the authentic 
GPAl sequence donated respectively by primer A and primer B. 

The NotI and Sad sites in the polyl inker of Cadus 1013 (pRS414) were removed by 
restriction with these enzymes followed by filling in with the Klenow fragment of DNA 
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polymerase I and blunt end ligation to yield Cadus 1 002. The 1.4 kb Pstl - EcoRI 5' fragment 
ofdPAl from YCplacI 1 \~OPAI containing the OP A I promoter and 5' untranslated region of 
(iPA\ was purified by gel electrophoresis using GeneClean (BiolOl ) and cloned into Pstl - 
EcoRI restricted Cadus 1013 to yield Cadus 1087. The PCR amplified Xhol - Kpnl fragment 
encoding the 3" untranslated region of CPA I was excised from Cadus 1089 and cloned into 
Xhol - Kpnl restricted Cadus 1087 to yield Cadus 1092. The Notl and Sacl sites in the 
pol\ linker of Cadus 1092 were removed by restriction with these enzymes, filling in w ith the 
Klenow fragment of DNA polymerase I, and blunt end ligation to yield Cadus 1110. The 
region of Cadus 1 122 encoding the region of (jPAI from the EcoRI site at -200 to ^ 1 20 w as 
amplified with Vent DNA polymerase (New England Biolabs. Beverly. MA) with the primers 
5' CCCGA ATCCACCAATTTCTTTACG .V <SEQ ID NO:69) and 
5' GCGGCGTCGACGCGGCCGCGTAACAGT 3' (SEQ ID NO:70) . 

The amplified product, bearing an EcoRI site at its 5' end and novel Sacl. Notl and Sail sites 
at its 3' end was restricted with EcoRI and Sail, gel purified using GeneClean II ( Biol 0 1 ). and 
cloned into EcoRI and Sail restricted Cadus 1 1 10 to yield Cadus I 127. The DNA sequence 
of the vector between the EcoRI site at -200 and the Kpnl site at the 3' end of the 3' 
untranslated region was verified by restriction enzyme mapping and dideoxynucleotide DNA 
sequence analysis. 
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PCR amplification of GPA 4; -Gcx proteins and cloning into Cadus 1127. cDNA 
clones encoding the human G alpha subunits Gas. Gai2. Gai3. and .V cerevisiae (ilWI were 
amplified with Vent thermostable polymerase (New England Bioloabs. Beverly. MA). The 
primer pairs used in the amplification are as follows: 

GctS Primer 1 : 5TTGCTGG AGCTCCGCCTGCTGCTGCTGGGTGCTGGAG3' (Sac I 
5' USEQIDNQ71> 
Primer 2 

5'CTGCTGGTCG ACGCGGCCGCGGGGGTTCCTTCTTAGAAGCAGC3" (Sail 3' ) (SEQ 
ID NO:72) 

Primer 3 : 5'GGGCTCGAGCCTTCTTAGAGC AGCTCGTAC3 ' (Xhol 3') (SEQ ID 

NQ:73) 

Gai2 Primer 1 : 5*CTGCTGGAGCTCAAGTTGCTGCTGTTGGGTGCTGGGG3* (Sad 
5') (SEQ ID NO 74) 
Primer 2 : 

5 CTGCTGGTCGACGCGGCCGCGCCCCTCAGAAGAGGCCGCGG TCC3* (Sail 3") 
(SEQ ID NO:75) 

Primer 3 : 5 GGGCTCGAGCCTCAGAAGAGGCCGCAGTC3" (Xhol 3') (SEQ ID 

\Q:76) 

Gai3 Primer 1 : 5"CTGCTGGAGCTCAAGCTGCTGCTACTCGGTGCTGGAG3* (Saci 
5 ' KSEQ ID NO:77) 
Primer 2 : 

5"CTGCTGGTCGACGCGGCCGCCACTAACATCCATGCTTCTCAATAAAGTC3* (Sail 
3' KSEQ ID NO:78) 

Primer 3 : 5'GGGCTCGAGCATGCTTCTCA ATAAAGTCCAC3' (Xhol 3') (SEQ 
IDNO:79) 

After amplification, products were purified by gel electrophoresis using GeneClean II 
(BiolOl) and were cleaved with the appropriate restriction enzymes for cloning into Cadus 
1 127. 



The h> brid GPA, -G J subunits were cloned via a Sac! site introduced at the desired 
position near the 5' end of the amplified genes and a Sal! or Xhol site introduced in the 3' 
untranslated region. Ligation mixtures were electroporated into competent bacteria and 
plasmid DNA was prepared from 50 cultures of ampicillin resistant bacteria. 

Construction of Integrating Vectors Encoding GPA 41 -G a Subunits. The coding region of 
each GPA 4[ -Ga hybrid was cloned into an integrating vector (pRS406 = CRA3 AmpR) using 
the BssHII sites flanking the polylinker cloning sites in this plasmid. Cadus 101 1 (pRS406) 
was restricted with BssHII. treated with shrimp alkaline phosphatase as per the 
manufacturer's specifications, and the linearized vector was purified by gel electrophoresis. 
Inserts from each of the GPA 4l -Got hybrids were excised with BssHII from the parental 
plasmid. and subcloned into gel purified Cadus 101 1 . 

Construction of GPA Bim -Ga Constructs. A novel Bam HI site was introduced in 
frame into the GPA1 coding region by PCR amplification using Cadus 1179 (ercoding a 
wildtype GPA1 allele with a novel Ncol site at the initiator methionine) as the template, 
VENT polymerase. and the following primers: Primer A = 5" 
GCATCCATCA ATA ATCCAG 3* (SEP ID NO:80> and Primer B = 5' G A A A C A A T G G A - 
TCCACTTCTTAC 3' (SEQ ID NO:81 ) . The 1.1 kb PCR product was gel purified with 
GeneClean II (BiolOl ). restricted with Ncol and BamHI and cloned into NcoI-BamHI cut and 
phosphatased Cadus 1 122 to y ield Cadus 1605. The sequence of C adus 1605 w as verified b\ 
restriction analysis and dideoxy-sequencing of double-stranded templates. Recombinant 
GPA Bam -Ga hybrids of Gas, Gai2, and Gal 6 were generated. Construction of Cadus 1855 
encoding recombinant GPA B ^ m -Ga 16 serves as a master example: construction of the other 
hybrids followed an analogous cloning strategy. The parental plasmid Cadus 1617. encoding 
native Gal 6. was restricted with Ncol and BamHI. treated with shrimp alkaline phosphatase 
as per the manufacturer's specifications and the linearized vector was purified by gel 
electrophoresis. Cadus 1605 was restricted with Ncol and BamHI and the 1.1 kb fragment 
encoding the amino terminal 60% of GPA1 with a novel BamHI site at the 3* end was cloned 
into the Ncol- and BamHI-restricted Cadus 1617. The resulting plasmid encoding the 
GPA Bim -Ga 16 hybrid was verified by restriction analysis and assayed in tester strains for an 
ability to couple to yeast Gpy and thereby suppress the gpaf null phenotype. Two additional 



GPA H ^, r -Ga h\brids. GPA^-Gas and GPA Bj:T1 -Gai2. described in this application were 
prepared in an analogous manner using Cadusl606 as the parental plasmid for the 
construction of the GPA bani -Ga i2 hybrid and Cadus 1181 as the parental plasmid for the 
construction of the GPA Hjni -Ga s hybrid. 

Coupling by chimeric Ga_proteins._The Ga chimeras described above were tested 
for the ability to couple a mammalian G protein-coupled receptor to the pheromone response 
pathway in yeast. The results of these experiments are outlined in Table 5. Results obtained 
using GPA 1 4i -Gai2 to couple the human C5a receptor to the pheromone response pathway in 
autocrine strains of yeast are disclosed in Example 10 abov e. 



Example 12 

Screening for Modulators of G-alpha activity 

Screens foi modulators of Ga activity may also be performed as shown in the following 
examples for illustration purposes, which are intended to be non-limiting. 

The strains used in this experiment are characterized in Table 9. Strains CY4874 and 
CV4877 arc isogenic but for the presence of Q205L mutation in the cloned Ga, : gene cloned 
hue plasmid 1. Strains CY4Q01 and CY4904 each have a chromosomally integrated chimeric 
Ga fusion comprising 41 amino acids of gpal at the N terminus of the human Ga, : gene and 
are isogenic but for the presence of a constitutively activating mutation in the C5a receptor 
gene of CY4901 . Strain CY5058 is a gpal mutant which carries only the yeast Gpy subunits 
and no Ga subunit. This strain is a control strain to demonstrate specificity of action on the 
Ga subunit. 



1. Suppression of Activation by .Mutation of Ga 



The Q205L mutation is a constitutive!) activated GTPase deficient mutant of the human Ga . 
gene. Antagonist compounds, chemicals or other substances which act on Gct ;: can be 
recognized by their action to reduce the level of activation and thus reduce the signal from the 
fus 1 -lacZ reporter gene on the second plasmid ( Plasmid 2 ). 



A. GTPase Ga l2 Mutants 

test component = gpa 4 -Ga : (Q : ,L) 

control component ■= gpa 41 -Ga, : 
As well as the CY4874 and CY4877 constructs detailed above, similar strains with fusl-His3 
or fus2-CANl growth readouts may also be used. The fus!-His3 strains are preferred for 
screening for agonists and the fus2-CAN 1 strains are preferred for antagonist screens. 



Readout test strain effect of Ga c control strain 

antauonist 

fust-HIS3 CY4868 inhibit growth on - CY4871 

HIS 

- AT( Aminotriazole) 
fusl-lacZ CV4874 reduce (3-gal activity CY4877 

fus2-CANl CY4892 induce growth on CY4386 

canavanine 
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In each case an antagonist should cause the test strain to beha\e more like the control strain. 



B. GTPase Ga s Mutants (Got Specificity) 

test component = Ga s (Q :: L» 
control component = Gct s 



Readout test strain effect of Ga ^ control strain 

antagonist 

fusl-HIS3 CY4880 none CY4883 

fusl-lacZ CY4886 none CY4889 

fus2-CAM CY4895 none CY4898 
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In each case a non-specific antagonist would cause the test strain to behave more like the 

control strain. 

Additional media requirements: -TRP for Get plasmid maintenance in fusl-HIS3 and fus2- 
CAN1 screens and -TRP -I RA for Ga and fusl-lacZ plasmid maintenance in fusl-lacZ 

screen. 
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II. Suppression of Activation by Receptors 

Constitutive!}' Activated C 5a Receptors 

test component - C5aR* (P X4 L. activated C5a Receptor) 

control component = C5aR 
The C5aR* mutation has a Leucine residue in place of the Proline residue of the w ild-type at 
position 184 of the amino acid sequence. 



Readout test strain effect of Ga. ^ control strain 

antagonist 

fusl-HIS3 CY4029 inhibit growth on - CY2246 

HIS 

~AT( Aminotriazole) 
fusl-lacZ CY4901 reduce P-gal activity CY4904 

fus2-CAM CY4365 induce growth on CY4362 

canavanine 



In each case an antagonist should cause the test strain to behave more like the control strain. 
Additional media requirements: -LEU for receptor plasmid maintenance in fusl-HIS3 and 
fus2-CANl screens and -LEU-URA for receptor and fusl-lacZ plasmid maintenance in fusl- 
lacZ screen, non-buffered yeast media (pH 5.5). 
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TABLE 1:ABC TRANSPORTERS* 



Species 


System 


Substrate 


Bacteria 


Salmonella 
typhi muri um 


OppABCDF 


Oligopeptides 


Streptococcus 

^JXJ cr LLtnO Jj J. <3 e 


AmiABCDEF 


Oligopeptides 


Bacillus 
subtilis 


Opp (SpoOK) 


Oligopeptides 


K. COljL 


Dpp 


Dipeptides 


Bacilus subtilis 


DciA 


Dipeptides 


S . typhi murium 


His JQMP 


Histidine 


E. coli 


HisJQMP 


Histidine 


E. coli 


MalEFGK 


Maltose 


S , typhi muri um 


MalEFGK 


Maltose 


En teroba cter 
aerogenes 


MalEFGK 


Maltose 


E. coli 


UgpABCE 


sn- Glycerol -3 - 
phosphate 


E. coli 


AraFGH 


Arab i nose 


E. coli 


RbsACD 


Ribose 


E. coli 


GlnHPQ 


Glutamine 


S. typhi muri um 


ProU (VWX) 


Glycine- be taine 


E. coli 


PrnTT ( VWY \ 


Glycme-betaxne 


E. coli 




Leucine- 

i so leucine- valine 


E. coli 


PstABC 


Phosphate 


Pseudomona s 


NosDYF 


Copper 


E. coli 


chijn 


iuu -Lyoaenuui 


E. coli 


CysPTWAM 


Sulphate- 
Thiosulf ate 


E. coli 


BtuCDE 


Vitamin B12 


E . coli 


FhuBCD 


Fe 3+ - f errichrome 


E. coli 


FecBCDE 


Fe 3+ -dicitrate 


S. marcescens 


Sf uABC 


Fe 3 + 


Mycoplasma 


p37, 29, 69 


? 

— — 



131 





E. coli 


Phn/Psi 


AiJcyi -pnospnonates 
(?) 




S trep tomyces 
peuce tius 


DrrAB 


Daunomycin/Doxorub 
icin 


5 


StireZDtOTTTVCf* <=; 




Tylosin 




Staphylococcus 


MsrA 


Erythromycin 
resistance 




Agroba. c teri urn 
tumefaciens 


OccJQMP 


Octopine 




E. coli 


HlyB 


Haemolysin 


10 


Pasturella 


LtkB 


leukotoxin 




coli 


CvaB 


Colicin V 




Erwinia 
chjrysan th emi 


XT J- l_ U 


Proteases 


15 


Bordetella 
pertussis 


CyaB 


Cyclolysin 




Streptococcus 
pneumoniae 


ComA 


Competence factor 




Rhizobium meliloti 


NdvA 


0-1,2 -glucan 


20 


A CTT~ C^i "hi r~* f~ ~y~ i Tim 
■"■j J- (^JUd U C- X J. LLLil 

tumefaci ens 


LaVA 


0-1,2 -glucan 




i icicanw^jiij. J. u o 

influenzae 


B exAB 


Capsule 

polysaccharide 




coli 


KpsMT 


Capsule 

polysaccharide 




Wiesseria 


CrtCD 


Capsule 

polysaccharide 


i ^ 
z. 


S. coli 


FtsE 


Cell division 




E. coli 


UvrA 


DNA repair 




Rhizobium 

1 egiiminosarum 


Nodi 


Nodulation 




Rhizobium meliloti 


0FR1 


? 


30 


Cyanobac teria 




Anabaena 


HetA 


Differentiation 




Synchococcus 


CysA 


Sulphate 




Yeast 




S. cerevisiae 


STE6 


a-mating peptide 




S. cerevisiae 


ADP1 


? 
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S. cerevisiae 


EF-3 


Translation 


Protozoa 


Plasmodium 


pfMDR 


Chloroquine 


Lieshmania 


ltpgpA 


Methotrexate /heavy 
metals 


Insect 


Drosophila 


whi te -brown 


Eye pigments 


Drosophila 


Mdr4 9 
Mdr6 5 


Hydrophobic drugs? 
? 


Plants 


Liverwort 
chloroplas t 


MbpX 


? 


Animals 


Man 


CFTR 


Chloride 


Mouse 


CFTR 


Chloride 


Xenopus 


CFTR 


Chloride 


Cow 


CFTR 


Chloride 


Dogfish 


CFTR 


Chloride 


Man 


MDR1 
MDR3 


Hydrophobic drugs 
? 


Mouse 


MDR1 
MDR2 


Hydrophobic drugs 
? 




MDR3 
MDR3 


Hydr ophob i c drugs 
Hydrophobic drugs 


Hamster 


Pgpl 
Pgp2 
Pgp3 


Hydrophobic drugs 
Hydrophob i c drugs 
? 


Man 


PMP7 0 


Polypeptides? 


Man 


RING4-11 
PSF1-PSF2 


Peptides 
Peptides 


Mouse 


HAM1-HAM2 


Peptides 


Rat 


Mtpl 


Peptides 



25 



+ Adapted from Higgins, C.F. 1992. ABC Transporters: Fi 
Microorcanizmz to Kan. Annu . Rev. Cell Biol. 8, 67-113 
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TABLE 2: HUMAN G PROTEIN- COUPLED SEVEN TRANSMEMBRANE 
RECEPTORS: REFERENCES FOR CLONING 



5 



10 



15 



Receptor 


Reference 


or 1A - adrenergic receptor 


Bruno et al . (1991) 


a, B - adrenergic receptor 


Ramarao et al . (1992) 


a 2 - adrenergic receptor 


Lomasney et al (1990) 


a 7B - adrenergic receptor 


Weinshank et al. (1990) 


/?j - adrenergic receptor 


Frielle et al . (1987) 


p 2 - adrenergic receptor 


Kobilka et al . (1987) 


(3 3 - adrenergic receptor 


Regan et al . (1988) 


rr^AChR, m2 AChR , m 3 AChR , 
n\; AChR 


Bonner et al. (1987) 
Peralta et al . (1987) 


m5 AChR 


Bonner et al . (1988) 


D! dopamine 


Dearry et al . (1990) 
Zhou et al . (1990) 
Sunahara et al . (1990) 
Weinshank et al . (1991) 


D 2 dopamine 


Grandy et al . (1989) 


D 3 dopamine 


Sokolof f et al. (1990) 


D 4 dopamine 


Van Tol et al . (1991) 


D 5 dopamine 


M. Caron (unpub . ) 

We i ns hank et a 1 . (1991) 


Al adenosine 


Libert et al . ( 1992 ) 


adenosine A2b 


Pierce et al . ( 1992 ) 


5-HTla 


Kobilka et al. (1937) 
Fargin e: a 1 . (1953) 
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5-HTlb 


Hamblin et al. (1992) 
Mochizuki et al . (1992) 




5HTl-like 


Levy et al . (1992a) 




5 -HTld 


Levy et al . (1992b) 




5HTld-like 


Hamblin and Metcalf (1991) 


5 


5 HTld beta 


Demchyshyn et al . (1992) 




substance K (neurokinin A) 


Gerard et al . (1990) 




substance P (NK1) 


Gerard, et al . (1991) ; 
-Ld-rvtiu.ct tr i_ ax. t iy y i j 




f -Met -Leu- Phe 


ouuiay trU al . i iy y U J 

Murphy & McDermott (1991) 
DeNardin et al . (1992) 




angiotensin II type 1 


Furuta et al . (1992) 


10 


mas proto- oncogene 


Young et al . (1986) 




endothelin ETA 


Aiciyztix ec ai . (iyy2) 
Hosoda et al . (1991) 




endothel in ETB 


Nakamuta et al . (1991) 

wy awa t: L- aJL. 1 1 y y 1 ) 




t~ t~ m V^l "5 n 


Vu et al . (19 91) 


15 


growth hormone - releasing 
hormone (GHRH) 


Mayo (19 92) 




vasoactive intestinal 


Sreedharan et al . (1991) 




peptide (VIP) 






oxytocin 


Kimura et al . , (1992) 


20 


somatostatin SSTR1 and 
SSTR2 


Yamada et al . (1992a) 



135 



SSTR3 


Yamada et al . (1992b) 


cannabinoid 


Gerard et al . (1991) 


follicle stimulating 
hormone (FSH) 


Minegish et al . (1991) 


LH/CG 


Mineqish et al mqqo^ 


thyroid stimulating 
hormone (TSH) 


Naqavama et si mqaq\ 
Libert et al . (1989) 
Misrahi et si mqqh\ 


thromboxane A2 


Hirata et al . (1991) 


pla t el et - ac t ivat ing factor 
(PAF) 


Kunz et al, (1992) 


S Tl A V"l \r 1 :3 -i -m 
^ Ja aiia|jny XaLUAin 


Boulay et al . (1991) 
Gerard and Gerard (1991) 


Interleukin 8 (IL-8) IL- 
8RA 


Holmes et al. (1991) 


IL-8RB 


Murohv and Tt f fnnv mqq-i\ 


Delta Opioid 


Evans et al . (1992) 


Kappa Opioid 


Xie et al. (1992) 


mip- l/RANTES 


Neote et al . ( 1993 ) 
Murphy et al . , in nr^^ 


Rhodopsin 


Nathans and Hogness (1984) 


Red opsin, Green opsin, 
Blue opsin 


Nathans, et al . (1986) 


met abet ropic glutamate 
mGluRl - 6 


Tanabe et al . (1992) 


hi s tanine H2 


Gantz et al . (1991) 
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ATP 


Julius, David (unpub.) 


neuropeptide Y 


Herzog et al . (1992) 
Larhammar et al. (1992) 




amyloid protein precursor 


Kang et al . (1987) 
Mita, et al . (1988) 
Lemaire et al . (1989) 


insulin-like growth factor 
II 


Kiess et al . (1988) 


bradykinin 


Hess et al . (1992) 


gonadotropin- releasing 
hormone 


Chi et al. (1993) 


cholecystokinin 


Pisegna et al . (1992) 


melanocyte stimulating 
hormone receptor 


Chhajlane et al . (1992) 
Mountjoy et al . (1992) 


antidiuretic hormone 
receptor 


Birnbaumer et al . (1992) 


glucagon receptor 


Sprecher et al. (1993) 


adrenocorticotropic 
hormone II 


Mountjoy et al . (1992) 
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Table 3. Detection of C5a production in yeast by ELISA. 



R-L- R + L- R-T.+ R + L + 



[C5a] in culture n.d.n.d. 0.64 ng/ml 0.5 ng/ml 

= 77 nM =60 nM 
[C5a] released n.d.n.d. 0.8 ng/ml 0.6 ng/ml 
from lysed cells* =97 nM =73 nM 

5a was detected by enzyme - linked immunosorbent assay (ELISA) . 
Molar concentrations were calculated using MW=8273 as 
predicted by C5a sequence. 
* Determined by pelleting cells, resuspending cells in the 
>riginal volume, breaking yeast with glass beads and assaying 

the resulting supernatant. 
n.d.=not done 



1 
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Table 4. Coupling of the C5a receptor to Gcx chimeras in 

yeast . 

ChimeraExpres sionResnlh 
Context 

GPA1 41 -Gai2single copy, Good signal to noise ratio: 
integrated, efficient coupling to yeast 
GPA1 promoter/^ . 
GPAl 41 -Gori3single copy, Poor signal to noise ratio: 
integrated, high background due to poor 
GPAl promotercoupling to yeast (3y , high 

LIRMA* . 

GPAl/3am-Gai21ow copy plasmid, Signal equal to that with 
GPA 1 promoterGPAl 41 -Gai2 , however, back- 
ground is greater. 
GPAl/?am-Gafl61ow copy plasmid, Poor signal to noise ratio, 
GPAl promoterhigh background due to poor 
coupling to yeast (3y , high 
LIRMA* . 

GPAl£am-Gc*slow copy plasmid, Unacceptably high background 
GPAl promoterdue to poor coupling to 
yeast /?' , high LIRMA* . 

*LIRMA = Ligand Independent Receptor Mediated Activation. 
With this phenomenon, there is an increase in growth on 
selective media for strains containing heterologous receptor 
in the absence of ligand. It is possible that some receptor 
antagonists would decrease LIRMA . It has been noted (Milano, 
et al. 1994) that specific antagonists reduce LIRMA of the (32 
adrenergic receptor when that receptor is overexpressed in 

transgenic mice. 
LIRMA may be exploited in several ways, including the 
identification of antagonists capable of reducing the 
phenomenon. A subset of antagonists would be expected no 



affect the receptor conformation in such a way as to prevent 

the downstream signalling that occurs in tl.e absence of 
agonist. LIRMA can be exploited to identify new G protein- 
coupled receptors by expressing cDNA clones in yeast strains 
expressing those chimeric G proteins which couple only poorly 
to yeast /? Y . m addition, LIRMA may permit the identification 
of inhibitors that are specific for G proteins. 
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Table 6. Coupling of Ga switch region hybrids to the 
pheromone response pathway. 



5 



Protein 


GPA1 amino 
acid sequences 


Gas amino acid 
sequences 


Pheno fc vn #=* 


GPA1 


1-472 


none 


Couples with 


GafS 


none 


1-394 


Couples with 
GjSy weakly 


GPA^-S 


1-41 


42-394 


Couples with 
G(3y weakly 


SGS 


297-333 


1-201 + 237- 
394 


Does not 
couple with 
G/3y 


GPA^-SGS 


1-41 + 297-333 


42-201 + 237- 
394 


Couples with 
G/Jy weakly 
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Table 8 

G protein- coupled receptors that couple through Gari 

M2 muscarinic acetylcholine 
M4 muscarinic acetylcholine 

5 adenosine Al 

adenosine A3 

a iA" adrenergic 
a 2B - adrenergic 
o> 2C - adrenergic 

10 bradykinin B 2 

cannabinoid 

D2 dopamine 
D4 dopamine 

ET B endothelin 

15 f ormyl -methionyl peptide receptor FPRj 

me tabo tropic glutamateR 2 
me tabo tropic glutamateR 3 
me tabo tropic glutamateR 4 

5HT 1A (serotonin) 
20 5HT 1B (serotonin) 

5HT 1D (serotonin) 
5HT 1E (serotonin) 
5HT ]F ( serotonin) 

neuropeptide Y 
25 delta opioid 

prostaglandin EP3 

somatostatin 2 
somatostatin 3 
somatostatin 4 

- 0 thrombin 



C5a 
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platelet activating factor 
angiotensin AT! 
angiotensin AT 2 
IL-8 
MCP1A 
MCP1B 
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CLAIMS 

1. A yeast cell having a pheromone system, which 

cell expresses (a) a heterologous surrogate of a yeast 
pheromone system protein, said surrogate, under at least 
some conditions, performing in the pheromone system of 
the yeast cell a function naturally performed by the 
corresponding yeast pheromone system protein, and (b) a 
heterologous peptide, whereby if said peptide modulates 
the interaction of said surrogate with said pheromone 
system, said modulation is a selectable or screenable 
event . 

2. The yeast cell of claim 1 wherein the endogenous 
pheromone system protein is not produced in functional 
form . 

3. The yeast cell of claim 1 wherein the peptide is 
secreted by the cell into the periplasmic space, from 
which it interacts with said surrogate. 

4. The yeast cell of claim 3, wherein the peptide is 
expressed in the form of a precursor peptide comprising a 
cleavable leader peptide and a mature peptide, and the 
leader peptide is substantially homologous to the leader 
peptide of the wild- type pheromone of said cell. 

5. The yeast cell of claim 4 wherein the wild- type 
leader peptide is that of the Saccharomyces cerevisiae a 
factor or a-factor. 

6. The yeast cell of claim 4 in which the wild-type 
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pheromone is not secreted. 

7. The yeast cell of claim 3 wherein the peptide is als 
expressed in a nonsecretory form. 

8. The yeast cell of claim 1 wherein the cell is a 
mutant strain having a reduced propensity, relative to 
the wild- type strain, to have its pheromone signal 
pathway desensitized through repeated or prolonged 
stimulation thereof. 

9. The yeast cell of claim 8 in which the SST2 gene is 
not functionally expressed. 

10. The yeast cell of claim 1, in which the FARl gene is 
not functionally expressed. 

11. The yeast cell of claim 1, further comprising a 
selectable marker that is activated by the pheromone 
signal pathway . 

12. The yeast cell of claim 11, said selectable marker 
comprising a pheromone - responsive promoter which is 
substantially homologous with an endogenous pheromone- 
responsive promoter, operably linked to a foreign 
selectable gene. 

13. The yeast cell of claim 12 wherein the selectable 
gene is an IGP dehydratase gene. 

14. The yeast cell of claim 12 wherein the homologous 
wild-type promoter is the FUS1 promoter. 
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15. The yeast cell of claim l wherein the cells belong 
to the species Saccharomvces cerevisiae . 

16. The yeast cell of claim 1 in which the pheromone 
system protein is a farnesyl transferase . 

17. The yeast cell of claim 1 in which the pheromone 
system protein is a carboxymethyl transferase . 

18. The yeast cell of claim 1 in which the pheromone 
system protein is a kinase. 

19. The yeast cell of claim 1 wherein the yeast 
pheromone system protein is a protease involved in the 
production of the mature form of the yeast pheromone, 
through the cleavage of a precursor protein, and the 
surrogate is also a protease. 

20. The yeast cell of claim 19 wherein the precursor 
protein produced in the cell is itself a surrogate of the 
yeast pheromone precursor protein, and said surrogate 
precursor protein has an amino acid sequence comprising a 
recognition site recognized by the surrogate protease, 
said recognition site differing from that recognized by 
the yeast pheromone system protease, but said surrogate 
precursor protein is cleaved by the surrogate protease to 
produce the mature form of the yeast pheromone. 

21. The yeast cell of claim 20 wherein the wild type 
yeast pheromone precursor protein is not produced. 
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The yeast cell of claim 1 wherein the yeast 
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pheromone system protein is an ABC transporter involved 
in the membrane transport of the yeast pheromone, and the 
surrogate is also an ABC transporter. 

23. The yeast cell of claim 22 wherein the surrogate 
transports the yeast pheromone unless the peptide 
interferes with such transport. 

24. The yeast cell of claim 22 wherein the surrogate 
transports the yeast pheromone only with the aid of said 
peptide . 

25. The yeast cell of claim 1 wherein the yeast 
pheromone system protein is the yeast pheromone receptor. 

26. The yeast cell of claim 25 in which the peptide 
is an agonist for the surrogate receptor. 

27. The yeast cell of claim 25 in which the peptide 
is an antagonist for the surrogate receptor. 

28. The yeast cell of claim 25 wherein the Get subunit 
of the G protein is chimeric. 

29. The yeast cell of claim 28 wherein the amino 
terminal portion of the Go: subunit is substantially 
homologous with the Ga subunit of a yeast G protein and 
the remainder is substantially homologous with the 
corresponding portion of a Get subunit of a heterologous G 
protein . 

30. The yeast cell of claim 1 in which the pheromone 
system protein is a cvclin. 
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31. The yeast cell of claim 30, said yeast cell 
further containing a non-pheromone- responsive screenable 
marker . 

32. A yeast culture comprising a plurality of yeast 
cells according to claim 1, said yeast cells collectively 
expressing a peptide library. 

33. A method of assaying a peptide for modulation of 
the activity of a non-yeast surrogate for a pheromone 
system protein which comprises providing yeast cells 
according to claim 1, which cells functionally express 
said surrogate and said peptide, and determining whether 
the pheromone signal pathway is activated or inhibited by 
said peptide. 

34. The method of claim 33 in which the cells 
comprise a pheromone- responsive selectable marker, and 
cells are selected for expression of a peptide having the 
desired activating or inhibiting effect. 

35. The method of claim 33 in which the cells 
comprise a pheromone - responsive screenable marker, and 
cells are screened for expression of a peptide having the 
desired activating or inhibiting effect. 

36. A method of assaying a peptide library for 
activity of a non-yeast pheromone system protein 
surrogate which comprises providing a yeast culture 
according to claim 32, whose cells each functionally 
express said surrogate and a peptide of said library, 
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said culture collectively expressing the entire peptide 
library, and determining whether the pheromone signal 
pathway is activated or inhibited by said peptides in 
each of the cells of said culture. 

37. The method of claim 34 in which the surrogate is 
human Mdrl , the cells grow on hist idine- free media only 
if the surrogate transports a- factor, the cells are 
galactose-sensitive only if the surrogate transports o>- 
factor, and endogenous pleiotropic drug resistance genes 
have been inactivated. 

38. The yeast cell of claim 25 wherein the surrogate 
receptor is the C5a receptor. 

39. The cell of claim 1 wherein the cognate yeast PSP 
is one involved in the upstream processing of the 
pheromone prior to its interaction with the receptor. 

40. The cell of claim 1 wherein the cognate yeast PSP 
is involved in the post - transit ional modification of the 
pheromone precursor to yield the mature pheromone. 

41. The cell of claim 1 wherein the cognate yeast PSP 
is involved in the secretion or transport of the 
pheromone . 

42. The cell of claim 1 wherein the cognate yeast PSP 
is involved in the downstream transduction of a signal 
received by the pheromone receptor. 
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ABSTRACT OF THE DISCLOSURE 



Yeast cells are engineered to express both a 
surrogate of a pheromone system protein (e.g., enzymes 
involved in maturation of of- factor, transporters of a- 
factor, pheromone receptors, etc.) and a potential 
peptide modulator of the surrogate, in such a manner that 
the inhibition or activation of the surrogate affects a 
screenable or selectable trait of the yeast cells. 
Various additional features improve the signal - to - noise 
ratio of the screening/selection system. 
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